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Preface 


This text, like its previous three editions, is an introduction to communication sys- 
tems written at a level appropriate for advanced undergraduates and first-year grad- 
uate students in electrical or computer engineering. New features in this edition 
include the introduction of two other authors, Professors Rutledge and Crilly, to pro- 
vide additional expertise for topics such as optical links and spread spectrum. 

An initial study of signal transmission and the inherent limitations of physical 
systems establishes unifying concepts of communication. Attention is then given to 
analog communication systems, random signals and noise, digital systems, and 
information theory. However, as indicated in the table of contents, instructors may 
choose to skip over topics that have already been or will be covered elsewhere. 

Mathematical techniques and models necessarily play an important role 
throughout the book, but always in the engineering context as means to an end. 
Numerous applications have been incorporated for their practical significance and 
as illustrations of concepts and design strategies. Some hardware considerations are 
also included to justify various communication methods, to stimulate interest, and to 
bring out connections with other branches of the field. 





PREREQUISITE BACKGROUND 


The assumed background is equivalent to the first two or three years of an electrical 
or computer engineering curriculum. Essential prerequisites are differential equa- 
tons, steady-state and transient circuit analysis, and a first course in electronics. 
Students should also have some familiarity with operational amplifiers, digital logic, 
and matrix notation. Helpful but not required are prior exposure to linear systems 
analysis, Fourier transforms, and probability theory. 








CONTENTS AND ORGANIZATION 


A distinctive feature of this edition is the position and treatment of probability, ran- 
dom signals, and noise. These topics are located after the discussion of analog sys- 
tems without noise. Other distinctive features are the new chapter on spread spec- 
trum systems and the revised chapter on information and detection theory near the 
end of the book. The specific topics are listed in the table of contents and discussed 
further in Sect. 1.4. 

Following an updated introductory chapter, this text has two chapters dealing 
with basic tools. These tools are then applied in the next four chapters to analog com- 
munication systems, including sampling and pulse modulation. Probability, random 
signals, and noise are introduced in the following three chapters and applied to ana- 
log systems. An appendix separately covers circuit and system noise. The remaining 


xi 





xii 


Preface 


six chapters are devoted to digital communication and information theory, which 
require some knowledge of random signals and include coded pulse modulation. 

All sixteen chapters can be presented in a year-long undergraduate course with 
minimum prerequisites. Or a one-term undergraduate course on analog communica- 
tion might consist of material in the first seven chapters. If linear systems and prob- 
ability theory are covered in prerequisite courses, then most of the last eight chap- 
ters can be included in a one-term senior/graduate course devoted primarily to 
digital communication. 

The modular chapter structure allows considerable latitude for other formats. 
As a guide to topic selection, the table of contents indicates the minimum prerequi- 


sites for each chapter section. Optional topics within chapters are marked by the 
symbol x. 


INSTRUCTIONAL AIDS 


Each chapter after the first one includes a list of instructional objectives to guide stu- 
dent study. Subsequent chapters also contain several examples and exercises. The 
exercises are designed to help students master their grasp of new material presented 
in the text, and exercise solutions are given at the back. The examples have been 
chosen to illuminate concepts and techniques that students often find troublesome. 

Problems at the ends of chapters are numbered by text section. They range from 
basic manipulations and computations to more advanced analysis and design tasks. 
A manual of problem solutions is available to instructors from the publisher. 


Several typographical devices have been incorporated to serve as aids for stu- 
dents. Specifically, 


* Technical terms are printed in boldface type when they first appear. 


* Important concepts and theorems that do not involve equations are printed 
inside boxes. i 


e Asterisks (*) after problem numbers indicate that answers are provided at the 
back of the book. 


* The symbol ¢ identifies the more challenging problems. 


Tables at the back of the book include transform pairs, mathematical relations, 
and probability functions for convenient reference. An annotated bibliography is 
also provided at the back in the form of a supplementary reading list. 

Communication system engineers use many abbreviations, so the index lists 
common abbreviations and their meanings. Thus, the index additionally serves as a 
guide to many abbreviations in communications. 
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2 CHAPTER 1 e Introduction 


A the Universe! By kingdoms, right wheel!” This prophetic phrase represents the first telegraph message 
on record. Samuel F. B. Morse sent it over a 16 km line in 1838. Thus a new era was born: the era of electri- 


cal communication. 


Now, over a century and a half later, communication engineering has advanced to the point that earthbound TV 
viewers watch astronauts working in space. Telephone, radio, and television are integral parts of modern life. Long- 
distance circuits span the globe carrying text, data, voice, and images. Computers talk to computers via interconti- 
nental networks, and control virtually every electrical appliance in our homes. Wireless personal communication 
devices keep us connected wherever we go. Certainly great strides have been made since the days of Morse. 
Equally certain, coming decades will usher in many new achievements of communication engineering. 

This textbook introduces electrical communication systems, including analysis methods, design principles, and hard- 
ware considerations. VVe begin with a descriptive overview that establishes a perspective for the chapters that follow. 














1.1 ELEMENTS AND LIMITATIONS 
OF COMMUNICATION SYSTEMS 


A communication system conveys information from its source to a destination some 
distance away. There are so many different applications of communication systems 
that we cannot attempt to cover every type. Nor can we discuss in detail all the indi- 
vidual parts that make up a specific system. A typical system involves numerous 
components that run the gamut of electrical engineering—circuits, electronics, elec- 
tromagnetics, signal processing, microprocessors, and communication networks, to 
name a few of the relevant fields. Moreover, a piece-by-piece treatment would 
obscure the essential point that a communication system is an integrated whole that 
really does exceed the sum of its parts. 

We therefore approach the subject from a more general viewpoint. Recognizing 
that all communication systems have the same basic function of information trans- 
fer, we'll seek out and isolate the principles and problems of conveying information 
in electrical form. These will be examined in sufficient depth to develop analysis 
and design methods suited to a wide range of applications. In short, this text is con- 
cerned with communication systems as systems. 


Information, Messages, and Signals 


Clearly, the concept of information is central to communication. But information is 
a loaded word, implying semantic and philosophical notions that defy precise defi- 
nition. We avoid these difficulties by dealing instead with the message, defined as 
the physical manifestation of information as produced by the source. Whatever form 
the message takes, the goal of a communication system is to reproduce at the desti- 
nation an acceptable replica of the source message. 

There are many kinds of information sources, including machines as well as 
people, and messages appear in various forms. Nonetheless, we can identify two 
distinct message categories, analog and digital. This distinction, in turn, determines 
the criterion for successful communication. 
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Figure 1.1-1 Communication system with input and output transducers. 


An analog message is a physical quantity that varies with time, usually in a 
smooth and continuous fashion. Examples of analog messages are the acoustic pres- 
sure produced when you speak, the angular position of an aircraft gyro, or the light 
intensity at some point in a television image. Since the information resides in a time- 
varying waveform, an analog communication system should deliver this waveform 
with a specified degree of fidelity. 

A digital message is an ordered sequence of symbols selected from a finite set 
of discrete elements. Examples of digital messages are the letters printed on this 
page, a listing of hourly temperature readings, or the keys you press on a computer 
keyboard. Since the information resides in discrete symbols, a digital communica- 
tion system should deliver these symbols with a specified degree of accuracy in a 
specified amount of time. 

Whether analog or digital, few message sources are inherently electrical. Con- 
sequently, most communication systems have input and output transducers as 
shown in Fig. 1.1-1. The input transducer converts the message to an electrical sig- 
nal, say a voltage or current, and another transducer at the destination converts the 
output signal to the desired message form. For instance, the transducers in a voice 
communication system could be a microphone at the input and a loudspeaker at the 
output. We'll assume hereafter that suitable transducers exist, and we'll concentrate 
primarily on the task of signal transmission. In this context the terms signal and 
message will be used interchangeably since the signal, like the message, is a physi- 
cal embodiment of information. 


Elements of a Communication System 


Figure 1.1—2 depicts the elements of a communication system, omitting transducers 
but including unwanted contaminations. There are three essential parts of any com- 
munication system, the transmitter, transmission channel, and receiver. Each part 
plays a particular role in signal transmission, as follows. 

The transmitter processes the input signal to produce a transmitted signal 
suited to the characteristics of the transmission channel. Signal processing for trans- 
mission almost always involves modulation and may also include coding. 

The transmission channel is the electrical medium that bridges the distance 
from source to destination. It may be a pair of wires, a coaxial cable, Gr a radio wave 
or laser beam. Every channel introduces some amount of transmission loss or atten- 
uation, so the signal power progressively decreases with increasing distance. 
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Figure 1.1-2 Elements of a communication system. 


The receiver operates on the output signal from the channel in preparation for 
delivery to the transducer at the destination. Receiver operations include amplifica- 
tion to compensate for transmission loss, and demodulation and decoding to 
reverse the signal-processing performed at the transmitter. Filtering is another 
important function at the receiver, for reasons discussed next. 

Various unwanted undesirable effects crop up in the course of signal transmis- 
sion. Attenuation is undesirable since it reduces signal strength at the receiver. 
More serious, however, are distortion, interference, and noise, which appear as alter- 
ations of the signal shape. Although such contaminations may occur at any point, 
the standard convention is to blame them entirely on the channel, treating the trans- 
mitter and receiver as being ideal. Figure 1.1—2 reflects this convention. 

Distortion is waveform perturbation caused by imperfect response of the sys- 
tem to the desired signal itself. Unlike noise and interference, distortion disappears 
when the signal is turned off. If the channel has a linear but distorting response, then 
distortion may be corrected, or at least reduced, with the help of special filters called 
equalizers. 

Interference is contamination by extraneous signals from human sources— 
other transmitters, power lines and machinery, switching circuits, and so on. Interfer- 
ence occurs most often in radio systems whose receiving antennas usually intercept 
several signals at the same time. Radio-frequency interference (RFI) also appears in 
cable systems if the transmission wires or receiver circuitry pick up signals radiated 
from nearby sources. Appropriate filtering removes interference to the extent that the 
interfering signals occupy different frequency bands than the desired signal. 

Noise refers to random and unpredictable electrical signals produced by natural 
processes both internal and external to the system. When such random variations are 
superimposed on an information-bearing signal, the message may be partially cor- 
rupted or totally obliterated. Filtering reduces noise contamination, but there 
inevitably remains some amount of noise that cannot be eliminated. This noise con- 
stitutes one of the fundamental system limitations. 

Finally, it should be noted that Fig. 1.1-2 represents one-way or simplex (SX) 
transmission. Two-way communication, of course, requires a transmitter and 
receiver at each end. A full-duplex (FDX) system has a channel that allows simulta- 
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neous transmission in both directions. A half-duplex (HDX) system allows trans- 
mission in either direction but not at the same time. 


Fundamental Limitations 


An engineer faces two general kinds of constraints when designing a communica- 
tion system. On the one hand are the technological problems, including such 
diverse considerations as hardware availability, economic factors, federal regula- 
tions, and so on. These are problems of feasibility that can be solved in theory, even 
though perfect solutions may not be practical. On the other hand are the fundamen- 
tal physical limitations, the laws of nature as they pertain to the task in question. 
These limitations ultimately dictate what can or cannot be accomplished, irrespec- 
tive of the technological problems. The fundamental limitations of information 
transmission by electrical means are bandwidth and noise. 

The concept of bandwidth applies to both signals and systems as a measure of 
speed. When a signal changes rapidly with time, its frequency content, or spec- 
trum, extends over a wide range and we say that the signal has a large bandwidth. 
Similarly, the ability of a system to follow signal variations is reflected in its usable 
frequency response or transmission bandwidth. Now all electrical systems contain 
energy-storage elements, and stored energy cannot be changed instantaneously. 
Consequently, every communication system has a finite bandwidth B that limits the 
rate of signal variations. 

Communication under real-time conditions requires sufficient transmission 
bandwidth to accommodate the signal spectrum; otherwise, severe distortion will 
result. Thus, for example, a bandwidth of several megahertz is needed for a TV 
video signal, while the much slower variations of a voice signal fit into B = 3 kHz. 
For a digital signal with r symbols per second, the bandwidth must be B = 7/2. In 
the case of information transmission without a real-time constraint, the available 
bandwidth determines the maximum signal speed. The time required to transmit a 
given amount of information is therefore inversely proportional to B. 

Noise imposes a second limitation on information transmission. Why is noise 
unavoidable? Rather curiously, the answer comes from kinetic theory. At any tem- 
perature above absolute zero, thermal energy causes microscopic particles to exhibit 
random motion. The random motion of charged particles such as electrons generates 
random currents or voltages called thermal noise. There are also other types of 
noise, but thermal noise appears in every communication system. 

We measure noise relative to an information signal in terms of the signal-to- 
noise power ratio S/N. Thermal noise power is ordinarily quite small, and S/N can 
be so large that the noise goes unnoticed. At lower values of S/N, however, noise 
degrades fidelity in analog communication and produces errors in digital commu- 
nication. These problems become most severe on long-distance links when the 
transmission loss reduces the received signal power down to the noise level. Ampli- 


fication at the receiver is then to no avail, because the noise will be amplified along 
with the signal. 
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Taking both limitations into account, Shannon (1948)! stated that the rate of 
information transmission cannot exceed the channel capacity. 


C = Blog (1 + S/N) 


This relationship, known as the Hartley-Shannon law, sets an upper limit on the 
performance of a communication system with a given bandwidth and signal-to- 
noise ratio. 


1.2 MODULATION AND CODING 


Modulation and coding are operations performed at the transmitter to achieve effi- 
cient and reliable information transmission. So important are these operations that 
they deserve further consideration here. Subsequently, we'll devote several chapters 
to modulating and coding techniques. 


Modulation Methods 


Modulation involves two waveforms: a modulating signal that represents the mes- 
sage, and a carrier wave that suits the particular application. A modulator systemat- 
ically alters the carrier wave in correspondence with the variations of the modulating 
signal. The resulting modulated wave thereby "carries" the message information. We 
generally require that modulation be a reversible operation, so the message can be 
retrieved by the complementary process of demodulation. 

Figure 1.2—1 depicts a portion of an analog modulating signal (part a) and the 
corresponding modulated waveform obtained by varying the amplitude of a sinu- 
soidal carrier wave (part 5). This is the familiar amplitude modulation (AM) used 
for radio broadcasting and other applications. À message may also be impressed 
on a sinusoidal carrier by frequency modulation (FM) or phase modulation (PM). 
All methods for sinusoidal carrier modulation are grouped under the heading of 
continuous-wave (CW) modulation. 

Incidentally, you act as a CW modulator whenever you speak. The transmission 
of voice through air is accomplished by generating carrier tones in the vocal cords 
and modulating these tones with muscular actions of the oral cavity. Thus, what the 
ear hears as speech is a modulated acoustic wave similar to an AM signal. 

Most long-distance transmission systems employ CW modulation with a carrier 
frequency much higher than the highest frequency component of the modulating sig- 
nal. The spectrum of the modulated signal then consists of a band of frequency com- 
ponents clustered around the carrier frequency. Under these conditions, we say that 
CW modulation produces frequency translation. In AM broadcasting, for example, 
the message spectrum typically runs from 100 Hz to 5 KHz; if the carner frequency is 
600 kHz, then the spectrum of the modulated carrier covers 595-605 KHz. 


TReferences are indicated in this fashion throughout the text. Complete citations are listed alphabeti- 
cally by author in the References at the end of the book. 
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Figure 1.2-1 (a) Modulating signal; (b) sinusoidal carrier with amplitude modulation; 
(c) pulse-train carrier with amplitude modulation. 


Another modulation method, called pulse modulation, has a periodic train of 
short pulses as the carrier wave. Figure 1.2-1c shows a waveform with pulse ampli- 
tude modulation (PAM). Notice that this PAM wave consists of short samples 
extracted from the analog signal at the top of the figure. Sampling is an important 
signal-processing technique and, subject to certain conditions, it's possible to 
reconstruct an entire waveform from periodic samples. 

But pulse modulation by itself does not produce the frequency translation 
needed for efficient signal transmission. Some transmitters therefore combine pulse 
and CW modulation. Other modulation techniques, described shortly, combine 
pulse modulation with coding. 


Modulation Benefits and Applications 


The primary purpose of modulation in a communication system is to generate a mod- 
ulated signal suited to the characteristics of the transmission channel. Actually, there 
are several practical benefits and applications of modulation briefly discussed below. 


Modulation for Efficient Transmission Signal transmission over appreciable distance 
always involves a traveling electromagnetic wave, with or without a guiding medium. 
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The efficiency of any particular transmission method depends upon the frequency of 
the signal being transmitted. By exploiting the frequency-translation property of CW 
modulation, message information can be impressed on a carrier whose frequency has 
been selected for the desired transmission method. 

As a case in point, efficient line-of-sight ratio propagation requires antennas 
whose physical dimensions are at least 1/10 of the signal's wavelength. Unmodulated 
transmission of an audio signal containing frequency components down to 100 Hz 
would thus call for antennas some 300 km long. Modulated transmission at 100 MHz, 
as in FM broadcasting, allows a practical antenna size of about one meter. At frequen- 
cies below 100 MHz, other propagation modes have better efficiency with reasonable 
antenna sizes. Tomasi (1994, Chap. 10) gives a compact treatment of radio propaga- 
tion and antennas. 

For reference purposes, Fig. 1.2-2 shows those portions of the electromagnetic 
spectrum suited to signal transmission. The figure includes the free-space wave- 
length, frequency-band designations, and typical transmission media and propaga- 
tion modes. Also indicated are representative applications authorized by the U.S. 
Federal Communications Commission. 


Modulation to Overcome Hardware Limitations The design of a communication 
System may be constrained by the cost and availability of hardware, hardware whose 
performance often depends upon the frequencies involved. Modulation permits the 
designer to place a signal in some frequency range that avoids hardware limitations. 
A particular concern along this line is the question of fractional bandwidth, 
defined as absolute bandwidth divided by the center frequency. Hardware costs and 
complications are minimized if the fractional bandwidth is kept within 1—10 per- 
cent. Fractional-bandwidth considerations account for the fact that modulation units 
are found in receivers as well as in transmitters. 

It likewise follows that signals with large bandwidth should be modulated on 
high-frequency carriers. Since information rate is proportional to bandwidth, 
according to the Hartley-Shannon law, we conclude that a high information rate 
requires a high carrier frequency. For instance, a 5 GHz microwave system can 
accommodate 10,000 times as much information in a given time interval as a 500 
kHz radio channel. Going even higher in the electromagnetic spectrum, one optical 
laser beam has a bandwidth potential equivalent to 10 million TV channels. 


Modulation to Reduce Noise and Interference A brute-force method for combating 
noise and interference is to increase tbe signal power until it overwhelms the con- 
taminations. But increasing power is costly and may damage equipment. (One of the 
early transatlantic cables was apparently destroyed by high-voltage rupture in an 
effort to obtain a usable received signal.) Fortunately, FM and certain other types of 
modulation have the valuable property of suppressing both noise and interference. 
This property is called wideband noise reduction because it requires the trans- 
mission bandwidth to be much greater than the bandwidth of the modulating signal. 
Wideband modulation thus allows the designer to exchange increased bandwidth for 
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decreased signal power, a trade-off implied by the Hartley-Shannon law. Note that a 
higher carrier frequency may be needed to accommodate wideband modulation. 


Modulation for Frequency Assignment When you tune a radio or television set to a 
particular station, you are selecting one of the many signals being received at that 
time. Since each station has a different assigned carrier frequency, the desired signal 
can be separated from the others by filtering. Were it not for modulation, only one 
station could broadcast in a given area; otherwise, two or more broadcasting stations 
would create a hopeless jumble of interference. 


Modulation for Multiplexing Multiplexing is the process of combining several sig- 
nals for simultaneous transmission on one channel. Frequency-division multiplex- 
ing (FDM) uses CW modulation to put each signal on a different carrier frequency, 
and a bank of filters separates the signals at the destination. Time-division multi- 
plexing (TDM) uses pulse modulation to put samples of different signals in nonover- 
lapping time slots. Back in Fig. 1.2-1c, for instance, the gaps between pulses could 
be filled with samples from other signals. A switching circuit at the destination then 
separates the samples for signal reconstruction. Applications of multiplexing include 
FM stereophonic broadcasting, cable TV, and long-distance telephone. 

A variation of multiplexing is multiple access (MA). Whereas multiplexing 
involves a fixed assignment of the common communications resource (such as fre- 
quency spectrum) at the local level, MA involves the remote sharing of the resource. 
For example, code-division multiple access (CDMA) assigns a unique code to each 
digital cellular user, and the individual transmissions are separated by correlation 
between the codes of the desired transmitting and receiving parties. Since CDMA 
allows different users to share the same frequency band simultaneously, it provides 
another way of increasing communication efficiency. 


Coding Methods and Benefits 


We’ve described modulation as a signal-processing operation for effective transmis- 
sion. Coding is a symbol-processing operation for improved communication when 
the information is digital or can be approximated in the form of discrete symbols. 
Both coding and modulation may be necessary for reliable long-distance digital 
transmission. 

"The operation of encoding transforms a digital message into a new sequence of 
symbols. Decoding converts an encoded sequence back to the original message 
with, perhaps, a few errors caused by transmission contaminations. Consider a com- 
puter or other digital source having M >> 2 symbols. Uncoded transmission of a 
message from this source would require M different waveforms, one for each sym- 
bol. Alternatively, each symbol could be represented by a binary codeword con- 
sisting of K binary digits. Since there are 2 possible codewords made up of K 
binary digits, we need K = log, M digits per codeword to encode M source symbols. 
If the source produces r symbols per second, the binary code will have Kr digits per 
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second and the transmission bandwidth requirement is K times the bandwidth of an 
uncoded signal. 

In exchange for increased bandwidth, binary encoding of M-ary source symbols 
offers two advantages. First, less complicated hardware is needed to handle a binary 
signal composed of just two different waveforms. Second, contaminating noise has 
less effect on a binary signal than it does on a signal composed of M different wave- 
forms, so there will be fewer errors caused by the noise. Hence, this coding method 
is essentially a digital technique for wideband noise reduction. 

Channel coding is a technique used to introduce controlled redundancy to fur- 
ther improve the performance reliability in a noisy channel. Error-control coding 
goes further in the direction of wideband noise reduction. By appending extra check 
digits to each binary codeword, we can detect, or even correct, most of the errors 
that do occur. Error-control coding increases both bandwidth and hardware com- 
plexity, but it pays off in terms of nearly error-free digital communication despite a 
low signal-to-noise ratio. 

Now, let's examine the other fundamental system limitation: bandwidth. Many 
communication systems rely on the telephone network for transmission. Since the 
bandwidth of the transmission system is limited by decades-old design specifica- 
tions, in order to increase the data rate, the signal bandwidth must be reduced. High- 
speed modems (data modulator/demodulators) are one application requiring such 
data reduction. Source-coding techniques take advantage of the statistical knowl- 
edge of the source signal to enable efficient encoding. Thus, source coding can be 
viewed as the dual of channel coding in that it reduces redundancy to achieve the 
desired efficiency. 

Finally, the benefits of digital coding can be incorporated in analog communi- 
cation with the help of an analog-to-digital conversion method such as pulse-code- 
modulation (PCM). A PCM signal is generated by sampling the analog message, 
digitizing (quantizing) the sample values, and encoding the sequence of digitized 
samples. In view of the reliability, versatility, and efficiency of digital transmission, 
PCM has become an important method for analog communication. Furthermore, 
when coupled with high-speed microprocessors, PCM makes it possible to substi- 
tute digital signal processing for analog operations. 





1.3 HISTORICAL PERSPECTIVE AND 
SOCIETAL IMPACT 


In our daily lives we often take for granted the powerful technologies that allow us 
to communicate, nearly instantaneously, with people around the world. Many of us 
now have multiple phone numbers to handle our home and business telephones, fac- 
simile machines, modems, and wireless personal communication devices. We send 
text, video, and music through electronic mail, and we "surf the Net" for informa- 
tion and entertainment. We have more television stations than we know what to do 
with, and “smart electronics" allow our household appliances to keep us posted on 
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their health. It is hard to believe that most of these technologies were developed in 
the past 50 years. 


Historical Perspective 


The organization of this text is dictated by pedagogical considerations and does not 
necessarily reflect the evolution of communication systems. To provide at least 
some historical perspective, a chronological outline of electrical communication is 
presented in Table 1.3—1. The table lists key inventions, scientific discoveries, 
important papers, and the names associated with these events. 











Table 1.3-1 A chronology of electrical communication 
Year Event 
1800-1837 Preliminary developments Volta discovers the primary battery; the mathematical 


treatises by Fourier, Cauchy, and Laplace; experiments on electricity and magnetism 
by Oersted, Ampere, Faraday, and Henry; Ohm’s law (1826); early telegraph systems 
by Gauss, Weber, and Wheatstone. 


1838-1866 Telegraphy Morse perfects his system; Steinheil finds that the earth can be used for 
a current path; commercial service initiated (1844); multiplexing techniques devised; 
William Thomson (Lord Kelvin) calculates the pulse response of a telegraph line 
(1855); transatlantic cables installed by Cyrus Field and associates. 


1845 Kirchhoff’s circuit laws enunciated. 
1864 Maxwell’s equations predict electromagnetic radiation. 
1876-1899 Telephony Acoustic transducer perfected by Alexander Graham Bell, after earlier 


attempts by Reis; first telephone exchange, in New Haven, with eight lines (1878); 
Edison’s carbon-button transducer; cable circuits introduced; Strowger devises auto- 
matic step-by-step switching (1887); the theory of cable loading by Heaviside, Pupin, 
and Campbell. 

1887-1907 Wireless telegraphy Heinrich Hertz verifies Maxell's theory; demonstrations by 
Marconi and Popov; Marconi patents a complete wireless telegraph system (1897); 
the theory of tuning circuits developed by Sir Oliver Lodge; commercial service 
begins, including ship-to-shore and transatlantic systems. 


1892-1899 Oliver Heaviside's publications on operational calculus, circuits, and electromagnetics. 


1904—1920 Communication electronics Lee De Forest invents the Audion (triode) based on 
Fleming's diode; basic filter types devised by G. A. Campbell and others; experi- 
ments with AM radio broadcasting; transcontinental telephone line witb electronic 
repeaters completed by the Bell System (1915); multiplexed carrier telephony intro- 
duced; E. H. Armstrong perfects the superheterodyne radio receiver (1918); first 
commercial broadcasting station, KDKA, Pittsburgh. 


1920-1928 Transmission theory Landmark papers on the theory of signal transmission and noise 
by J. R. Carson, H. Nyquist, J. B. Johnson, and R. V. L. Hartley. 

1923-1938 Television Mechanical image-formation system demonstrated by Baird and Jenkins; 
theoretical analysis of bandwidth requirements; Farnsworth and Zworykin propose 


electronic systems; vacuum cathode-ray tubes perfected by DuMont and others; field 
tests and experimental broadcasting begin. 


1927 Federal Communications Commission established. 


Table 1.3-1 
Year 

1931 

1934 

1936 

1937 
1938-1945 


1944—1947 


1948-1950 


1948-1951 
1950 
1953 
1955 
1956 
1958 
1960 
1961 


1962 
1962-1966 


1963 
1964 
1965 
1966-1975 


1969 
1971 
1972 


1980 
1981 


1982 
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A chronology of electrical communication (continued) 
Event 

Teletypewriter service initiated. 

H. S. Black develops the negative-feedback amplifier. 
Armstrong’s paper states the case for FM radio. 

Alec Reeves conceives pulse-code modulation. 


World War IT Radar and microwave systems developed; FM used extensively for 
military communications; improved electronics, hardware, and theory in all areas. 


Statistical communication theory Rice develops a mathematical representation of 


noise; Weiner, Kolmogoroff, and Kotel’ nikov apply statistical methods to signal 
detection. 


Information theory and coding C. E. Shannon publishes the founding papers of 
information theory; Hamming and Golay devise error-correcting codes. 
Transistor devices invented by Bardeen, Brattain, and Shockley. 

Time-division multiplexing applied to telephony. 

Color TV standards established in the United States. 

J. R. Pierce proposes satellite communication systems. 

First transoceanic telephone cable (36 voice channels). 

Long-distance data transmission system developed for military purposes. 


Maiman demonstrates the first laser. 


Integrated circuits go into commercial production; stereo FM broadcasts begin in the 
U.S. 


Satellite communication begins with Telstar I. 


High-speed digital communication Data transmission service offered commercially; 
Touch-Tone telephone service introduced; wideband channels designed for digital 
signaling; pulse-code modulation proves feasible for voice and TV transmission; 
major breakthroughs in theory and implementation of digital transmission, including 
error-control coding methods by Viterbi and others, and the development of adaptive 
equalization by Lucky and coworkers. 


Solid-state microwave oscillators perfected by Gunn. 
Fully electronic telephone switching system (No. 1 ESS) goes into service. 
Mariner IV transmits pictures from Mars to Earth. 


Wideband communication systems Cable TV systems; commercial satellite relay 
service becomes available; optical links using lasers and fiber optics. 


ARPANET created (precursor to Internet) 


Intel develops first single-chip microprocessor 


Motorola develops cellular telephone; first live TV broadcast across Atlantic ocean 
via satellite 


Compact disc developed by Philips and Sony 


FCC adopts rules creating commercial cellular telephone service; IBM PC is intro- 
duced (hard drives introduced two years later). 


AT&T agrees to divest 22 local service telephone companies; seven regional Bell 
system operating companies formed. 


(continued) 
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Table 1.3-1 A chronology of electrical communication (continued) 

Year Event 

1985 Fax machines widely available in offices. 

1988-1989 Installation of trans-Pacific and trans-Atlantic optical cables for light-wave commu- 
nications. 

1990-2000 Digital communication systems Digital signal processing and communication sys- 


tems in household appliances; digitally tuned receivers; direct-sequence spread spec- 
trum systems; integrated services digital networks (ISDNs); high-definition digital 
television (HDTV) standards developed; digital pagers; handheld computers; digital 
cellular. 

1994—1995 FCC raises $7.7 billion in auction of frequency spectrum for broadband personal 
communication devices 


1998 Digital television service launched in U.S. 


Several of the terms in the chronology have been mentioned already, while oth- 
ers will be described in later chapters when we discuss the impact and interrelation- 


ships of particular events. You may therefore find it helpful to refer back to this table 
from time to time. 


Societal Impact 


Our planet feels a little smaller in large part due to advances in communication. Mul- 
tiple sources constantly provide us with the latest news of world events, and savvy 
leaders make great use of this to shape opinions in their own countries and abroad. 
Communication technologies change how we do business, and once-powerful com- 
panies, unable to adapt, are disappearing. Cable and telecommunications industries 
split and merge at a dizzying pace, and the boundaries between their technologies and 
those of computer hardware and software companies are becoming blurred. We are 
able (and expected) to be connected 24 hours a day, seven days a week, which means 
that we may continue to receive work-related E-mail, phone calls, and faxes, even 
while on vacation at the beach or in an area once considered remote. 

These technology changes spur new public policy debates, chiefly over issues 
of personal privacy, information security, and copyright protection. New businesses 
taking advantage of the latest technologies appear at a faster rate than the laws and 
policies required to govern these issues. With so many computer systems connected 
to the Internet, malicious individuals can quickly spread computer viruses around 
the globe. Cellular phones are so pervasive that theaters and restaurants have created 
policies governing their use. For example, it was not so long ago that before a show 
an announcement would be made that smoking was not allowed in the auditorium. 
Now some theaters request that members of the audience turn off cell phones and 
beepers. State laws, municipal franchises, and public utility commissions must 
change to accommodate the telecommunications revolution. And the workforce 
must stay current with advances in technology via continuing education. 


1.4 Prospectus . 


With new technologies developing at an exponential rate, we cannot say for cer- 
tain what the world will be like in another 50 years. Nevertheless, a firm grounding 
in the basics of communication systems, creativity, commitment to ethical applica- 
tion of technology, and strong problem solving skills will equip the communications 
engineer with the capability to shape that future. 
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This text provides a comprehensive introduction to analog and digital communica- 
tions. A review of relevant background material precedes each major topic that is 
presented. Each chapter begins with an overview of the subjects covered and a list- 
ing of learning objectives. Throughout the text we rely heavily on mathematical 
models to cut to the heart of complex problems. Keep in mind, however, that such 
models must be combined with physical reasoning and engineering judgment. 

Chapters 2 and 3 deal with deterministic signals, emphasizing time-domain and 
frequency-domain analysis of signal transmission, distortion, and filtering. Chapters 
4 and 5 discuss the how and the why of various types of CW modulation. Particular 
topics include modulated waveforms, transmitters, and transmission bandwidth. 
Sampling and pulse modulation are introduced in Chapter 6, followed by analog 
modulation systems, including receivers, multiplexing systems, and television sys- 
tems in Chapter 7. Before a discussion of the impact of noise on CW modulation 
systems in Chapter 10, Chapters 8 and 9 apply probability theory and statistics to the 
representation of random signals and noise. 

Digital communication starts in Chapter 11 with baseband (unmodulated) 
transmission, so we can focus on the important concepts of digital signals and spec- 
tra, noise and errors, and synchronization. Chapter 12 then draws upon previous 
chapters for the study of coded pulse modulation, including PCM and digital multi- 
plexing systems. A short survey of error-control coding is presented in Chapter 13. 
Chapter 14 analyzes digital transmission systems with CW modulation, culminating 
in a performance comparison of various methods. An expanded presentation of 
spread spectrum systems is presented in this edition in Chapter 15. Finally, an intro- 
duction to information theory in Chapter 16 provides a retrospective view of digital 
communication and returns us to the Hartley-Shannon law. 

Each chapter contains several exercises designed to clarify and reinforce the 
concepts and analytic techniques. You should work these exercises as you come to 
them, checking your results with the answers provided at the back of the book. Also 
at the back you’ ll find tables containing handy summaries of important text material 
and mathematical relations pertinent to the exercises and to the problems at the end 
of each chapter. 

Although we mostly describe communication systems in terms of “black 
boxes" with specified properties, we'll occasionally lift the lid to look at electronic 
circuits that carry out particular operations. Such digressions are intended to be 
illustrative rather than a comprehensive treatment of communication electronics. 
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Besides discussions of electronics, certain optional or more advanced topics are 
interspersed in various chapters and identified by the symbol *. These topics may 
be omitted without loss of continuity. Other optional material of a supplementary 
nature is contained in the appendix. 

Two types of references have been included. Books and papers cited within 
chapters provide further information about specific items. Additional references are 
collected in a supplementary reading list that serves as an annotated bibliography 
for those who wish to pursue subjects 1n greater depth. 

Finally, as you have probably observed, communications engineers use many 
abbreviations and acronyms. Most abbreviations defined in this book are also listed 
in the index, to which you can refer if you happen to forget a definition. 
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lectrical communication signals are time-varying quantities such as voltage or current. Although a signal physically 
exists in the time domain, we can also represent it in the frequency domain where we view the signal as com 
sisting of sinusoidal components at various frequencies. This frequency-domain description is called the spectrum. 

Spectral analysis, using the Fourier series and transform, is one of the fundamental methods of communication 
engineering. It allows us to treat entire classes of signals that have similar properties in the frequency domain, rather 
than getting bogged down in detailed time-domain analysis of individual signals. Furthermore, when coupled with 
the frequency-response characteristics of filters and other system components, the spectral approach provides valu- 
able insight for design work. 

This chapter therefore is devoted to signals and spectral analysis, giving special attention to the frequency- 
domain interpretation of signal properties. We'll examine line spectra based on the Fourier series expansion of peri- 
odic signals, and continuous spectra based on the Fourier transform of nonperiodic signals. These two types of spec- 
ira will ultimately be merged with the help of the impulse concept. 

As the first step in spectral analysis we must write equations representing signals as functions of time. But such 
equations are only mathematical models of the real world, and imperfect models at that. In fact, a completely faithful 
description: of the simplest physical signal would be quite complicated and impractical for engineering purposes. 
Hence, we try to devise models that represent with minimum complexity the significant properties of physical signals. 
The study of many different signal models provides us with the background needed to choose appropriate models for 
specific applications. In many cases, the models will apply only to particular classes of signals. Throughout the chap- 
ter the major classifications of signals will be highlighted for their special properties. 


OBJECTIVES 








After studying this chapter and working the exercises, you should be able to do each of the following: 


1. Sketch and label the one-sided or two-sided line spectrum of a signal consisting of a sum of sinusoids (Sect. 2.1). 
2. Calculate the average value, average power, and total energy of a simple signal (Sects. 2.1 and 2.2). 


3. | Write the expressions for the exponential Fourier series and coefficients, the trigonometric Fourier series, and the 
direct and inverse Fourier transform (Sects. 2.1 and 2.2). 


4. Identify the time-domain properties of a signal from its frequency-domain representation and vice versa (Sect. 2.2). 


5. Sketch and label the spectrum of a rectangular pulse train, a single rectangular pulse, or a sinc pulse (Sects. 2.1 
and 2.2). 


6. State and apply Parseval's power theorem and Rayleigh's energy theorem (Sects. 2.1 and 2.2). 


7. State the following transform theorems: superposition, time delay, scale change, frequency translation and modu- 
Jation, differentiation and integration (Sect. 2.3). 


8. Use transform theorems to find and sketch the spectrum of a signal defined by time-domain operations (Sect. 2.3). 


Set up the convolution integral and simplify it as much as possible when one of the functions is a rectangular pulse 
(Sect. 2.4). 


10. State and apply the convolution theorems (Sect. 2.4). 
11. Evaluate or otherwise simplify expressions containing impulses (Sect. 2.5). 


12. Find the spectrum of a signal consisting of constants, steps, impulses, sinusoids, and/or rectangular and triangular 
functions (Sect. 2.5). 


2.1 Line Spectra and Fourier Series 


2.1 LINE SPECTRA AND FOURIER SERIES 


This section introduces and interprets the frequency domain in terms of rotating 
phasors. We'll begin with the line spectrum of a sinusoidal signal. Then we'll invoke 


the Fourier series expansion to obtain the line spectrum of any periodic signal that 
has finite average power. 


Phasors and Line Spectra 


Consider the familiar sinusoidal or ac (alternating-current) waveform v(t) plotted in 


Fig. 2.1-1. By convention, we express sinusoids in terms of the cosine function and 
write 


v(t) = A cos (wt  $) 11] 


where A is the peak value or amplitude and wọ is the radian frequency. The phase 
angle ¢ represents the fact that the peak has been shifted away from the time origin 
and occurs at t = — $/«. Equation (1) implies that v(t) repeats itself for all time, 


with repetition period 7, = 27/wọ. The reciprocal of the period equals the cyclical 
frequency 


A | Wo 


P e E [2] 
fo Ty 2r 
measured in cycles per second or hertz. 
Obyiously, no real signal goes on forever, but Eq. (1) could be a reasonable 
model for a sinusoidal waveform that lasts a long time compared to the period. In 
particular, ac steady-state circuit analysis depends upon the assumption of an eternal 


sinusoid—usually represented by a complex exponential or phasor. Phasors also 
play a major role in the spectral analysis. 


The phasor representation of a sinusoidal signal comes from Euler’s theorem 
D 


e^? = cos 0. + j sin 0 [3] 














Figure 2.1-1 A sinusoidal waveform v(t) = A cos (wot + œ). 
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where 2 N/—1 and 6 is an arbitrary angle. If we let 0 = wot + $, we can write 
any sinusoid as the real part of a complex exponential, namely 


A cos (wt + p) = A Re [eM'*9)] [4] 
= Re [Ae /$g 9v] 


This 1s called a phasor representation because the term inside the brackets may be 
viewed as a rotating vector in a complex plane whose axes are tbe real and imagi- 
nary parts, as Fig. 2.1—2a illustrates. The phasor has length A, rotates counterclock- 
wise at a rate of fọ revolutions per second, and at time t = 0 makes an angle @ with 
respect to the positive real axis. The projection of the phasor on the real axis equals 
the sinusoid in Eq. (4). 

Now observe that only three parameters completely specify a phasor: amplitude, 
phase angle, and rotational frequency. To describe the same phasor in the frequency 
domain, we must associate the corresponding amplitude and phase with tbe particular 
frequency fo. Hence, a suitable frequency-domain description would be the line spec- 
trum in Fig. 2.1-2b, which consists of two plots: amplitude versus frequency and 
phase versus frequency. While this figure appears simple to the point of being trivial, 
it does have great conceptual value when extended to more complicated signals. But 
before taking that step, four conventions regarding line spectra should be stated. 


1. Inallour spectral drawings the independent variable will be cyclical frequency 
f hertz, rather than radian frequency w, and any specific frequency such as fo 
will be identified by a subscript. (We'll still use w with or without subscripts as 
a shorthand notation for 2af since that combination occurs so often.) 


2. Phase angles will be measured with respect to cosine waves or, equivalently, 
with respect to the positive real axis of the phasor diagram. Hence, sine waves 
need to be converted to cosines via the identity 


sin wt = cos (wt — 90°) [5] 





Amplitude 
L— 
— 








4 
> 0 fo 
& i a 
M i wg t+ o i £a 
: NOME X f 
Real axis A COS (Wot + $) 0 ho 
a) (b) 


Figure 2.1-2 Representations of A cos (mof + 4). (a) Phasor diagram; [b] line spectrum. 
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Uo 


' We regard amplitude as always being a positive quantity. When negative signs 
appear, they must be absorbed in the phase using 


—A cos wt = A cos (wt + 180°) [6] 


It does not matter whether you take + 180° or — 180? since the phasor ends up 
in the same place either way. 


4. Phase angles usually are expressed in degrees even though other angles such as 
wt are inherently in radians. No confusion should result from this mixed nota- 
tion since angles expressed in degrees will always carry the appropriate symbol. 


To illustrate these conventions and to carry further the idea of line spectrum, 
consider the signal 


w(t) = 7 — 10 cos (40v: — 60°) + 4 sin 12071 


which is sketched in Fig. 2.1—3a. Converting the constant term to a zero frequency or 
dc (direct-current) component and applying Eqs. (5) and (6) gives the sum of cosines 


w(t) = 7 cos 2z0t + 10 cos (2720t + 120?) + 4 cos (277601 — 90°) 


whose spectrum is shown in Fig. 2.1-3b. 

Drawings like Fig. 2.1—35, called one-sided or positive-frequency line spectra, 
can be constructed for any linear combination of sinusoids. But another spectral rep- 
resentation turns out to be more valuable, even though it involves negative frequen- 
cies. We obtain this representation from Eq. (4) by recalling that Re[z] = 3(z + z*), 
where z is any complex quantity with complex conjugate z*. Hence, ifz = Ae/9e/"v 
then z* = Ae 9e J^ and Eq. (4) becomes 


Å od Wie sous 
A cos (wt + 6) = sore e IRE 17] 


so we now have a pair of conjugate phasors. 

The corresponding phasor diagram and line spectrum are shown in Fig. 2.1-4. 
The phasor diagram consists of two phasors with equal lengths but opposite angles 
and directions of rotation. The phasor sum always falls along the real axis to yield 
A cos (@ot + $). The line of spectrum is two-sided since it must include negative 
frequencies to allow for the opposite rotational directions, and one-half of the origi- 
nal amplitude is associated with each of the two frequencies +f. The amplitude 
spectrum has even symmetry while the phase spectrum has odd symmetry because 
we are dealing with conjugate phasors. This symmetry appears more vividly in 
Fig. 2.1-5, which is the two-sided version of Fig. 2.1—3b. 

It should be emphasized that these line spectra, one-sided or two-sided, are just 
pictorial ways of representing sinusoidal or phasor time functions. A single line in the 
one-sided spectrum represents a real cosine wave, whereas a single line in the two- 
sided spectrum represents a complex exponential and the conjugate term must be 
added to get a real cosine wave. Thus, whenever we speak of some frequency interval 
such as f, to f; in a two-sided spectrum, we should also include the corresponding 
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(a) Conjugate phasors; (b} two-sided spectrum. 
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Figure 2.1-5 


negative-frequency interval —f, to —f,. A simple notation for specifying both inter- 
vals is f, = |f| € f. 
Finally, note that 





Putting this another way, the amplitude spectrum displays the signal's frequency 
content. 





Construct the one-sided and two-sided spectrum of v(t) = 








Periodic Signals and Average Power 


Sinusoids and phasors are members of the general class of periodic signals. These 
signals obey the relationship 


v(t + mTp) = v(t) -0 « t € oo [8] 


where m is.any integer. This equation simply says that shifting the signal by an integer 
number of periods to the left or right leaves the waveform unchanged. Consequently, a 
periodic signal is fully described by specifying its behavior over any one period. 

The frequency-domain representation of a periodic signal is a line spectrum 
obtained by Fourier series expansion. The expansion requires that the signal have 
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finite average power. Because average power and other time averages are important 
signal properties, we'll formalize these concepts here. 
Given any time function v(?), its average value over all time is defined as 


" j (72 
(v(t)) = lim = | v(t) dt [9] 
Too T -7/2 

The notation (v(t)) represents the averaging operation on the right-hand side, which 
comprises three steps: integrate v(t) to get the net area under the curve from — 7/2 = 
t < T/2; divide that area by the duration T of the time interval; then let T — oo to 
encompass all time. In the case of a periodic signal, Eq. (9) reduces to the average 
over any interval of duration Tọ. Thus 


(u(t)) = T |. wt dt — z| v(t)dt — [10] 


t 0 ^n 


where the shorthand symbol fy stands for an integration from any time 1, to t; + To. 

If v(t) happens to be the voltage across a resistance R, it produces the current 
i(t) = v(t/R and we could compute the resulting average power by averaging the 
instantaneous power v(f)(f) = v^(D/R = Rf(D. But we don't necessarily know 
whether a given signal is a voltage or current, so let's normalize power by assuming 
henceforth that R = 1 Q. Our definition of the average power associated with an 
arbitrary periodic signal then becomes 


PS (uo) =x | Mop a m 


where we have written |v(2)? instead of v?(?) to allow for the possibility of complex 
signal models. In any case, the value of P will be real and nonnegative. 

When the integral in Eq. (11) exists and yields 0 < P < oo, the signal v(t) is 
said to have well-defined average power, and will be called a periodic power signal. 
Almost all periodic signals of practical interest fall in this category. The average 
value of a power signal may be positive, negative, or zero. 

Some signal averages can be found by inspection, using the physical interpreta- 
tion of averaging. As a specific example take the sinusoid 


v(t) = A cos (œt + $) 
which has 


(v(t))=0 P-— [12] 


You should have no trouble confirming these results if you sketch one period of v(t) 
and |v(2)|?. 
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Fourier Series 


The signal w(t) back in Fig. 2.1-3 was generated by summing a dc term and two 
sinusoids. Now we'll go the other way and decompose periodic signals into sums of 
sinusoids or, equivalently, rotating phasors. We invoke the exponential Fourier 
series for this purpose. 


Let v(t) be a power signal with period Tọ = 1/f;. Its exponential Fourier series 
expansion is 


v(t) — pi Cpe n. neos [13] 


n--oo 
The series coefficients are related to u(t) by 


1 oar 
Cs Ty. | v(t)e 7? "ft dt [14] 
0 Jn 
so c, equals the average of the product v(t)e 7?""^*'. Since the coefficients are com- 
plex quantities in general, they can be expressed in the polar form 


C, = eal eÍ ES 6 


where arg c, stands for the angle of c,. Equation (13) thus expands a periodic power 
signal as an infinite sum of phasors, the nth term being 


c, e not = DA ef us Cag J2mnfot 


The series convergence properties will be discussed after considering its spectral 
implications. 

Observe that v(f) in Eq. (13) consists of phasors with amplitude |c,| and angle 
arg c, at the frequencies nf = 0, =f, —2fy . . . . Hence, the corresponding 
frequency-domain picture is a two-sided line spectrum defined by the series coeffi- 
cients. We emphasize the spectral interpretation by writing 


c(nfo) È c, 


so that |c(nfo)| represents the amplitude spectrum as a function of f, and arg c(nfp) 


represents the phase spectrum. Three important spectral properties of periodic 
power signals are listed below. 


1. All frequencies are integer multiples or harmonics of the fundamental fre- 
quency f = 1/T,. Thus the spectral lines have uniform spacing fo. 


2. The dc component equals the average value of the signal, since setting n = 0 in 
Eq. (14) yields 


c(0) = zi u(t) dt = (v(t)) [15] 


To 
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Calculated values of c(0) may be checked by inspecting u(t)—a wise practice 
when the integration gives an ambiguous result. 


3. Ifw(7 is a real (noncomplex) function of time, then 
Cap = cf = |c,| e ? "$9 [16a] 
which follows from Eq. (14) with n replaced by —n. Hence 
Ie(7nf9)| = le(nfo)| arg (nf) = ~arg c(nfo) [165] 


which means that the amplitude spectrum has even symmetry and the phase 
spectrum has odd symmetry. 


When dealing with rea! signals, the property in Eq. (16) allows us to regroup the 


exponential series into complex-conjugate pairs, except for cy. Equation (13) then 
becomes 


oo 
v(t) = co + v, |2c,| cos (2mnfo t + arg c,) [17] 
n-i 
which is the trigonometric Fourier series and suggests a one-sided spectrum. Most 
of the time, however, we'll use the exponential series and two-sided spectra. 
One final comment should be made before taking up an example. The integra- 
tion for c, often involves a phasor average in the form 


T/2 1 
= e Tft dt = ——— (eT — ge infT [18] 
le J2mf rl ) 
EE. sin Tf T 
"fT 


Since this expression occurs time and again in spectral analysis, we'll now introduce 
the sinc function defined by 


in TÀ 
sinc & SATS [19] 
TÀ 
where A represents the independent variable. Some authors use the related sampling 
function defined as Sa (x) & (sin x)/x so that sinc A = Sa (m). Figure 2.1-6 shows 


that sinc A is an even function of A having its peak at A = 0 and zero crossings at all 
other integer values of A, so 


: n RE 
sinc A = 
0 A= +1, #2, ... 


Numerical values of sinc A and sinc? A are given in Table T.4 at the back of the book, 
while Table T.3 includes several mathematical relations that you’ll find helpful for 
Fourier analysis. 
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sinc A 








Figure 2.1-6 The function sinc A = {sin wA)/7A. 





Rectangular Pulse Train EXAMPLE 2.1-1 


Consider the periodic train of rectangular pulses in Fig. 2.1—7. Each pulse has height 
or amplitude A and width or duration r. There are stepwise discontinuities at each 
pulse-edge location t = +7/2, and so on, so the values of v(t) are undefined at these 
points of discontinuity. This brings out another possible difference between a phys- 
ical signal and its mathematical model, for a physical signal never makes a perfect 
stepwise transition. However, the model may still be reasonable if the actual transi- 
tion times are quite small compared to the pulse duration. 

To calculate the Fourier coefficients, we'll take the range of integration in 
Eq. (14) over the central period —T)/2 = t = Ty/2, where 


= r |t| < 7/2 


0 |t > 7/2 
Thus 
To/2 1 7/2 
6, mL | v(te I dt = — | Aen dt 
0 4-1/2 To -1/2 
pa A (e —janfort _ ener) 
—j2mnfo To 


A sinmnfoT 


To Tf, 
Multiplying and dividing by 7 finally gives 


Toa 
Ca = — sinc nfo 7 [20] 
To 


which follows from Eq. (19) with A = nfo 7. 
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Figure 2.1-7 Rectangular pulse train. 
The amplitude spectrum obtained from |c(nfo)| = le, = Af tlsinc nf 7| is 


shown in Fig. 2.1-8a for the case of T/Ty = fy T = 1/4. We construct this plot by 
drawing the continuous function Af, r|sinc fr| as a dashed curve, which becomes the 
envelope of the lines. The spectral lines at —4fy, +8f, and so on, are “missing” 
since they fall precisely at multiples of 1/r where the envelope equals zero. The dc 
component has amplitude c(0) = AT/Ty which should be recognized as the average 
value of u(t) by inspection of Fig. 2.1-7. Incidentally, 7/T, equals the ratio of “on” 
time to period, frequently designated as the duty cycle in pulse electronics work. 

The phase spectrum in Fig. 2.1-8b is obtained by observing that c, is always 
real but sometimes negative. Hence, arg c(nfo) takes on the values 0? and «180^, 
depending on the sign of sinc nf; T. Both +180° and —180? were used here to bring 
out the odd symmetry of the phase. 

Having decomposed the pulse train into its frequency components, let's build it 
back up again. For that purpose, we'll write out the trigonometric series in Eq. (17), 


Ic(nfo)l 


A for 
N 


A fgrlsinc frl 





-f 
1 -fp °fo2f 1 2 3 4 
yee LL T T 
lc) 
arg [c( fo)] 





(b) 


Figure 2.1-8 Spectrum of rectangular pulse train with fr = 1/4. [a] Amplitude; [b) phase. 
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still taking 7/Tg = fy T = 1/4 so cy = A/4 and [2c,| = (24/4) |sinc n/4| = 
(2A/qn)|sin vn/4|. Thus 


A V2A A 
dO Eire m COS Wy t  — cos 2 t + 








cos Bwt t cc 


Summing terms through the third harmonic gives the approximation of u(t) sketched 
in Fig. 2.1-9a. This approximation contains the gross features of the pulse train but 
lacks sharp corners. A more accurate approximation shown in Fig. 2.1-9b comprises 
all components through the seventh harmonic. Note that the small-amplitude higher 
harmonics serve primarily to square up the corners. Also note that the series is con- 
verging toward the midpoint value A/2 at t = +7/2 where v(t) has discontinuities. 





Sketch the amplitude spectrum of a rectangular pulse train for each of the following EXERCISE 2.1-2 
cases: T = To/5, T = Ty2, T = Tp. In the last case the pulse train degenerates into a 
constant for all time; how does this show up in the spectrum? 





Convergence Conditions and Gibbs Phenomenon 


We've seen that a periodic signal can be approximated with a finite number of terms 
of its Fourier series. But does the infinite series converge to u(t)? The study of con- 
vergence involves subtle mathematical considerations that we'll not go into here. 
Instead, we'll state without proof some of tbe important results. Further details are 
given by Ziemer, Tranter and Fannin (1998) or Stark, Tuteur and Anderson (1988). 

The Dirichlet conditions for Fourier series expansion are as follows: If a peri- 
odic function v(f) has a finite number of maxima, minima, and discontinuities per 
period, and if u(t) is absolutely integrable, so that u(t) has a finite area per period, 
then the Fourier series exists and converges uniformly wherever u(t) is continuous. 
These conditions are sufficient but not strictly necessary. 

An alternative condition is that u(t) be square integrable, so that |v(2? has 
finite area per period—equivalent to a power signal. Under this condition, the series 
converges in the mean such that if 


N 
wnt) = Y, cere 
n=—-N 


then 


Jim | |u(t) — vy(t)|? dt = 0 


To 


In other words, the mean square difference between v(t) and the partial sum v,(d) 
vanishes as more terms are included. 
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Figure 2.1-9 Fourier-series reconstruction of a rectangular pulse train. 
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Regardless of whether v(f) is absolutely integrable or square integrable, the 
series exhibits a behavior known as Gibbs phenomenon at points of discontinuity. 
Figure 2.1—10 illustrates this behavior for a stepwise discontinuity at t = tọ. The partial 
sum uy(ft) converges to the midpoint at the discontinuity, which seems quite reason- 
able. However, on each side of the discontinuity, vy(f) has oscillatory overshoot with 
period T5/2N and peak value of about 9 percent of the step height, independent of N. 
Thus, as N — oo, the oscillations collapse into nonvanishing spikes called “Gibbs 
ears” above and below the discontinuity as shown in Fig. 2.1-9c. Kamen and Heck 
(1997, Chap. 4) provide Matlab examples to further illustrate Gibbs phenomenon. 

Since a real signal must be continuous, Gibbs phenomenon does not occur and 
we're justified in treating the Fourier series as being identical to u(t). But idealized 
signal models like the rectangular pulse train often do have discontinuities. You 
therefore need to pay attention to convergence when working with such models. 

Gibbs phenomenon also has implications for the shapes of the filters used with 
real signals. An ideal filter that is shaped like a rectangular pulse will result in dis- 
continuities in the spectrum that will lead to distortions in the time signal. Another 
way to view this is that multiplying a signal in the frequency domain by a rectangu- 
lar filter results in the-time signal being convolved with a sinc function. Therefore, 
real applications use other window shapes with better time-frequency characteris- 
tics, such as Hamming or Hanning windows. See Oppenheim, Schafer and Buck 
(1999) for a more complete discussion on the effects of window shape. 


Parseval’s Power Theorem 


Parseval’s theorem relates the average power P of a periodic signal to its Fourier 
coefficients. To derive the theorem, we start with 


P= z | \v(t)|? dt = E | v(t)u*(t) dt 


T, 0 ^n 





Figure 2.1-10 Gibbs phenomenon at a step discontinuity. 
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Now replace u*(f) by its exponential series 


oo * eo 
v*(t) NE | = ePi ae N c* e Tht 


n--—co n-—-oco 


So that 


1 co 
p= z| zo Y ote Pen | dt 
To J, 


n--co 
e ud 
= 5 Iz | p(r)e imm ae 
n--oo To To 
and the integral inside the sum equals c,. Thus 


oo 


P= X až= V ja’ [21] 


n--oco n---oo 


which is Parseval's theorem. 
The spectral interpretation of this result is extraordinarily simple: 





Observe that Eq. (21) does not involve the phase spectrum, underscoring our prior 
comment about the dominant role of the amplitude spectrum relative to a signal's 
frequency content. For further interpretation of Eq. (21) recall that the exponential 
Fourier series expands v(® as a sum of phasors of the form c, e17", You can easily 
show that the average power of each phasor 1s 


(lc, e? 7" = Jel? [22] 


Therefore, Parseval's theorem implies superposition of average power, since the 
total average power of u(f) is the sum of the average powers of its phasor components. 

Several other theorems pertaining to Fourier series could be stated here. How- 
ever, they are more conveniently treated as special cases of Fourier transform theo- 
rems covered in Sect. 2.3. Table T.2 lists some of the results, along with the Fourier 
coefficients for various periodic waveforms encountered in communication systems. 





EXERCISE 2.1-3 
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2.2 Fourier Transforms and Continuous Spectra 


33 





2.2 FOURIER TRANSFORMS AND 
CONTINUOUS SPECTRA 


Now let's turn from periodic signals that last forever (in theory) to nonperiodic sig- 
nals concentrated over relatively short-time durations. If a nonperiodic signal has 


finite total energy, its frequency-domain representation will be a continuous spec- 
trum obtained from the Fourier transform. 


Fourier Transforms 


Figure 2.2-1 shows two typical nonperiodic signals. The single rectangular pulse 
(Fig. 2.2-1a) is strictly timelimited since v(?) is identically zero outside the pulse 
duration. The other signal is asymptotically timelimited in the sense that v(t) — 0 
as t > X oo. Such signals may also be described loosely as "pulses." In either case, 
if you attempt to average v(?) or |v(D[? over all time you'll find that these averages 
equal zero. Consequently, instead of talking about average power, a more meaning- 
ful property of a nonperiodic signal is its energy. l 

If v(t) is the voltage across a resistance, the total delivered energy would be 
found by integrating the instantaneous power v?(t)/R. We therefore define normal- 
ized signal energy as 


co 
E 


E jule) |? at [1] 


Too 


v(t) 
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ib) 
Figure 2.2-1 
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Some energy calculations can be done by inspection, since E is just the total area 
under the curve of |v(#)|*. For instance, the energy of a rectangular pulse with ampli- 
tude A is simply E = A?r. 

When the integral in Eq. (1) exists and yields 0 < E < co, the signal v(f) is said 
to have well-defined energy and is called a nonperiodic energy signal. Almost all 
timelimited signals of practical interest fall in this category, which is the essential 
condition of spectral analysis using the Fourier transform. 

To introduce the Fourier transform, we'll start with the Fourier series represen- 
tation of a periodic power signal 


oo 


Y c(nfg)e ?7nhst [2] 


n---oo 


$ [2 | nema 
Ti 


n--—oo 0 T. 


li 


v(t) 


ll 


where the integral expression for c(nfo) has been written out in full. According to the 
Fourier integral theorem there's a similar representation for a nonperiodic energy 
signal that may be viewed as a limiting form of the Fourier series of a signal as the 
period goes to infinity. Example 2.1—1 showed that the spectral components of a 
pulse train are spaced at intervals of nf; = n/T, so they become closer together as 
the period of the pulse train increased. However, the shape of the spectrum remains 
unchanged if the pulse width 7 stays constant. Let the frequency spacing f; = Tg’ 
approach zero (represented in Eq. 3 as df) and the index n approach infinity such 
that the product nf, approaches a continuous frequency variable f. Then 


v(t) = | E | | T y(the nh ater df [3] 


—00 -00 


The bracketed term is the Fourier transform of u(t) symbolized by V(f) or &[v(2] 
and defined as | 


Wf) = &[vt)] à | v(t)e inh dt [4] 
an integration over all time that yields a function of the continuous variable f. 
The time function v(£) is recovered from V( f) by the inverse Fourier transform 


A 


u(t) = $^ tV(£)] | Viper dr is 


-—O00 
an integration over all frequency f. To be more precise, it should be stated that 
9 [V(f)] converges in the mean to v(t), similar to Fourier series convergence, 
with Gibbs phenomenon occurring at discontinuities. But we'll regard Eq. (5) as 
being an equality for most purposes. A proof that #~![V(f)] = v(t) will be outlined 
in Sect. 2.5. 


2.2 Fourier Transforms and Continuous Spectra 


Equations (4) and (5) constitute the pair of Fourier integrals'. At first glance, 
these integrals seem to be a closed circle of operations. In a given problem, however, 
you usually know either V(f) or v(t). If you know V(f), you can find v(t) from 
Eq. (5); if you know v(2), you can find V(f) from Eq. (4). 

Turning to the frequency-domain picture, a comparison of Eqs. (2) and (5) indi- 
cates that V( f) plays the same role for nonperiodic signals that c(nfg) plays for peri- 
odic signals. Thus, V(f) is the spectrum of the nonperiodic signal u(t). But V(f) is 
a continuous function defined for all values of f whereas c(nfo) is defined only for 
discrete frequencies. Therefore, a nonperiodic signal will have a continuous spec- 
trum rather than a line spectrum. Again, comparing Eqs. (2) and (5) helps explain 
this difference: in the periodic case we return to the time domain by summing 
discrete-frequency phasors, while in the nonperiodic case we integrate a continuous 
frequency function. Three major properties of V(f) are listed below. 


1. The Fourier transform is a complex function, so |V(f)| is the amplitude spec- 
trum of u(t) and arg V(f) is the phase spectrum. 


2. The value of V(f) at f = 0 equals the net area of v(t), since 


v(0) = | v(t) dt i6] 


which compares with the periodic case where c(0) equals the average value of 
v(t). 
3. Ifv(ð is real, then 
MSc a [7 al 
and 
Wis WP gf) = are Vy) [75] 


so again we have even amplitude symmetry and odd phase symmetry. The term 
hermitian symmetry describes complex functions that obey Eq. (7). 





Rectangular Pulse 
In the last section we found the line spectrum of a rectangular pulse train. Now con- 


sider the single rectangular pulse in Fig. 2.2-1a. This is so common a signal model 
that it deserves a symbol of its own. Let's adopt the pictorial notation 


Tir) PS * \t| < 7/2 


0 \t| > 7/2 a 


1 Other definitions take o for the frequency variable and therefore include 1/27 or 1/ V7 as multi- 
plying terms. 
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which stands for a rectangular function with unit amplitude and duration 7 centered 
at ¢ = 0. The pulse in the figure is then written 


v(t) = AII(r/7) [9a] 
Inserting v(t) in Eq. (4) yields 
7/2 l A 
Vif) = | Ae /?*F dt = — sin mfr [95] 
-1/2 vf 
— AT sinc fr 


so V(Q) = Ar, which clearly equals the pulse's area. The corresponding spectrum, 
plotted in Fig. 2.2-2, should be compared with Fig. 2.1-8 to illustrate the similari- 
ties and differences between line spectra and continuous spectra. 

Further inspection of Fig. 2.2—2 reveals that the significant portion of the spec- 
trum is in the range |f] < 1/7 since |V(f)| << |V(0)| for |f| > 1/7. We therefore may 
take 1/r as a measure of the spectral “width.” Now if the pulse duration is reduced 
(small 7), the frequency width is increased, whereas increasing the duration reduces 
the spectral width. Thus, short pulses have broad spectra, and long pulses have nar- 
row spectra. This phenomenon, called reciprocal spreading, is a general property 
of all signals, pulses or not, because high-frequency components are demanded by 
rapid time variations while smoother and slower time variations require relatively 
little high-frequency content. 


IVAI 


AT 





-l/r 0 Vr 2/7 3/r Alv 


arg Wf) 





Figure 2.2-2 Rectangular pulse spectrum V|f) = Az sinc fr. 


2.2 Fourier Transforms and Continuous Spectra 


Symmetric and Causal Signals 


When a signal possesses symmetry with respect to the time axis, its transform inte- 
gral can be simplified. Of course any signal symmetry depends upon both the wave- 
shape and the location of the time origin. But we're usually free to choose the time 
origin since it's not physically unique—as contrasted with the frequency-domain 
origin which has a definite physical meaning. 

To develop the time-symmetry properties, we'll write œw in place of 27f for 
notational convenience and expand Eq. (4) using e^?" = cos wt — j sin wt. Thus, in 


general 
Wf) = Vf) + VS) [10a] 

where 
Vf) = | v(t) cos wt dt [105] 


Vif) 2 -[ v(t) sin wt dt 


—oo 


which are the even and odd parts of V(f), regardless of v(t). Incidentally, note that 
if v(f) is real, then 


Re[V(£f)] = V(£ | Im[V(£)] = Vf) 
so V*(f) = Vf) —JV4(f) ^ V(—f), as previously asserted in Eq. (7). 


When v(t) has time symmetry, we simplify the integrals in Eq. (100) by apply- 
ing the general relationship 


| w(t) dt — [wo dt + f w(t) dt [11] 


=œ 0 —o0 
2 w(t) dt w(t) even 
=4 


0 w(t) odd 
where w(t) stands for either u(t) cos wt or Uf) sin wt. If u(t) has even symmetry so that 
v(—t) = v(t) [12a] 
then v(t) cos wt is even whereas v(t) sin wt is odd. Hence, V,(f) = 0 and 


Wf) VF) = 2| wo) COS wt dt [12b] 
0 


Conversely, if v(f) has odd symmetry so that 


v(—-t) = —v(t) [13a] 
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then 


oo 


Vif) 9 JW (f) = -| u(t) sin wt dt [135] 
0 
and V,(f) = 0. 

Equations (12) and (13) further show that the spectrum of a real symmetrical 
signal will be either purely real and even or purely imaginary and odd. For instance, 
the rectangular pulse in Example 2.2—1 is a real and even time function and its spec- 
trum was found to be a real and even frequency function. 

Now consider the case of a causal signal, defined by the property that 


u(t) = 0 pep [14a] 


This simply means that the signal “starts” at or after £ = 0. Since causality precludes 


any time symmetry, the spectrum consists of both real and imaginary parts com- 
puted from 


Vf) = | v(t)e If dt [145] 
0 


This integral bears a resemblance to the Laplace transform commonly used for the 
study of transients in linear circuits and systems. Therefore, we should briefly con- 
sider the similarities and differences between these two types of transforms. 

The unilateral or one-sided Laplace transform is a function of the complex vari- 
able s = c + jw defined by 


S[w(t)] £ | v(t)e^* dt 
0 

which implies that u(t) = 0 for t < 0. Comparing £[v(t)] with Eq. (145) shows that 
if v(t) is a causal energy signal, you can get V( f) from the Laplace transform by let- 
ting s = j2mf. But a typical table of Laplace transforms includes many nonenergy 
signals whose Laplace transforms exist only with o > 0 so that |v(2)e7?| = [vde] ^ 
0 as t — oo. Such signals do not have a Fourier transform because s = o + jo falls 
outside the frequency domain when o # 0. On the other hand, the Fourier transform 
exists for noncausal energy signals that do not have a Laplace transform. See Kamen 
and Heck (1997, Chap. 7) for further discussion. 





EXAMPLE 2.2-2 


Causal Exponential Pulse 
Figure 2.2-3a shows a causal waveform that decays exponentially with time con- 
stant 1/b, so 


v(t) = 


Ae "5 t>0 
[15al 


0 t«0 
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The spectrum can be obtained from Eq. (145) or from the Laplace transform £[v(1)] = 
Al(s + b), with the result that 


A 


MT e gae 


[155] 


which is a complex function in unrationalized form. Multiplying numerator and 
denominator of Eq. (15b) by b — j2af yields the rationalized expression 


b — j2rf 
VW => 
(f) b? + (2f)? 
v(t) 
A 
t 
0 l/b 
(a) 
IVF I 











(b) 


Figure 2.2-3 Causal exponential pulse. (a) Waveform; (b) spectrum. 
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and we see that 











cidem 
Vf) = Re [Vif] U p? T (2af)? 
E "T 2TfÁ 
Keds Udee c o 
Conversion to polar form then gives the amplitude and phase spectrum 
A 
VG = VVXf) + VAF) = 
arg V(f) = arctan "n = -arctan 277 


which are plotted in Fig. 2.2~3b. 

The phase spectrum in this case is a smooth curve that includes all angles from 
—90° to +90°. This is due to the signal’s lack of time symmetry. But V(f) still has 
hermitian symmetry since v(t) is a real function. Also note that the spectral width is 
proportional to b, whereas the time “width” is proportional to the time constant 
1/b—another illustration of reciprocal spreading. 


Find and sketch V(f) for the symmetrical decaying exponential v(t) = Ae"! in 


Fig. 2.2-1b. (You must use a definite integral from Table T.3.) Compare your result 
with V.(f) in Example 2.2-2. Confirm the reciprocal-spreading effect by calculating 


the frequency range such that |V(f)| = (1/2)|W(0)]. 


Rayleigh's Energy Theorem 


Rayleigh's energy theorem is analogous to Parseval's power theorem. It states that 
the energy E of a signal v(?) is related to the spectrum V( f) by 


e= vovnar- | mare ne 


—oo 


Therefore, 





2.2 Fourier Transforms and Continuous Spectra 


The value of Eq. (16) lies not so much in computing £, since the time-domain 
integration of |v(2)? often is easier. Rather, it implies that |V(f)|* gives the distribu- 
tion of energy in the frequency domain, and therefore may be termed the energy 
spectral density. By this we mean that the energy in any differential frequency band 
df equals |V(f)|? df, an interpretation we'll further justify in Sect. 3.6. That interpre- 
tation, in turn, lends quantitative support to the notion of spectral width in the sense 
that most of the energy of a given signal should be contained in the range of fre- 
quencies taken to be the spectral width. 

By way of illustration, Fig. 2.2—4 is the energy spectral density of a rectangular 
pulse, whose spectral width was claimed to be |f| < 1/r. The energy in that band is 
the shaded area in the figure, namely 


l/r l/r 
| IW V)? af = | (Av)? sinc? fr df = 0.92.A?r 
—1/r -l 
a calculation that requires numerical methods. But the total pulse energy is E ^ A?z, 
so the asserted spectral width encompasses more than 90 percent of the total energy. 


Rayleigh's theorem is actually a special case of the more general integral rela- 
tionship 


| v(t)w*(t) dt = | V(f)W*(f) df [17] 
where v(t) and w(t) are arbitrary energy signals with transforms V(f) and W(f). 
Equation (17) yields Eq. (16) if you let w(t) = v(t) and note that f*9,, v(ru*(f) dt = 
E. Other applications of Eq. (17) will emerge subsequently. 
The proof of Eq. (17) follows the same lines as our derivation of Parseval’s 
theorem. We substitute for w*(r) the inverse transform 


wt) =| | “Wem 4| = | OP) d 


IV FP 





-3i -2h -Mr 9% ir Ar 3i 


Figure 2.2-4 Energy spectral density of a rectangular pulse. 
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Interchanging the order of time and frequency integrations then gives 


| v(t)w*(t) dt = | TO i W*(f)e = a dt 


—Ooo0 700 —oo 


- I vom dena 
-—O00 00 
which completes the proof since the bracketed term equals V( f). 

The interchange of integral operations illustrated here is a valuable technique in 
signal analysis, leading to many useful results. However, you should not apply the tech- 
nique willy-nilly without giving some thought to the validity of the interchange. As a 
pragmatic guideline, you can assume that the interchange is valid if the results make 
sense. If in doubt, test the results with some simple cases having known answers. 





EXERCISE 2.2-2 


Calculate the energy of a causal exponential pulse by applying Rayleigh's theorem 
to V(f) in Eq. (15b). Then check the result by integrating |v(2)/?. 





Duality Theorem 


If you reexamine the pair of Fourier integrals, you'll see that they differ only by the 
variable of integration and the sign in the exponent. A fascinating consequence of 
this similarity is the duality theorem. The theorem states that if v( and V( f) con- 
stitute a known transform pair, and if there exists a time function z(f) related to the 
function V(f) by 


z(t) = V(t) [18a] 
then 


#[z(t)] = v(-f) [185] 
where v(—f) equals v(t) with t = —f. 
Proving the duality theorem hinges upon recognizing that Fourier transforms 
are definite integrals whose variables of integration are dummy variables. Therefore, 
we may replace f in Eq. (5) with the dummy variable A and write 


oo 
v(t) — | V(A)e?7* dÀ 
Furthermore, since t is a dummy variable in Eq. (4) and since z(t) = V(t) in the 
theorem, 


Flz(t)] = | z(A)e f^ da = | V(A)e/?74C da 


—o0 —CO 


Comparing these integrals then confirms that #{z(t)] = v(—f). 


2.2 Fourier Transforms and Continuous Spectra 


Although the statement of duality in Eq. (18) seems somewhat abstract, it turns 
out to be a handy way of generating new transform pairs without the labor of inte- 
gration. The theorem works best when v(f) is real and even so z(f) will also be real 


and even, and Z(f) = #[z(t)] ^ v(—f) = v(f). The following example should clar- 
ify the procedure. 





Sinc Pulse 
A rather strange but important time function in communication theory is the sinc 
pulse plotted in Fig. 2.2—5a and defined by 


z(t) = A sinc 2Wt [19a] 
We'll obtain Z(f) by applying duality to the transform pair 
v(t) = BII(t/7) Vif) = Br sinc fr 
Rewriting Eq. (19a) as 
A 
te (2 inc t(2 
z(t) (4) W) sinc t(2W) 


brings out the fact that z(t) = V(t) with 7 = 2W and B = A/2W. Duality then says 
that FO] = v(—f) = BI- fiT) = (A/2WAI(—f/2W) or 


A f 
ns) = &n( 4) m 
since the rectangle function has even symmetry. 

The plot of Z(f), given in Fig. 2.2—5b, shows that the spectrum of a sinc pulse 
equals zero for |f| > W. Thus, the spectrum has clearly defined width W, measured 
in terms of positive frequency, and we say that Z(f) is bandlimited. Note, however, 
that the signal z(t) goes on forever and is only asymptotically timelimited. 





Find the transform of z(t) = B/[1 + (277t)?] by applying duality to the result of Exer- 


cise 2.2—1. 





z(t ZCf) 
a^ AINW 
t - f 
-1/2W MT -W 9 W 
la) (b) 


Figure 2.2-5 À sinc pulse and its bandlimited spectrum. 
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Transform Calculations 


Except in the case of a very simple waveform, brute-force integration should be 
viewed as the method of last resort for transform calculations. Other, more practical 
methods are discussed here. 

When the signal in question is defined mathematically, you should first consult 
a table of Fourier transforms to see if the calculation has been done before. Both 
columns of the table may be useful, in view of the duality theorem. A table of 
Laplace transforms also has some value, as mentioned in conjunction with Eq. (14). 

Besides duality, there are several additional transform theorems covered in 
Sect. 2.3. These theorems often help you decompose a complicated waveform into 
simpler parts whose transforms are known. Along this same line, you may find it 
expedient to approximate a waveform in terms of idealized signal models. Suppose 
Z(t) approximates z(r) and magnitude-squared error \z(t) — Z(OP is a small quantity. If 
Z(f) = FRA] and Z(f) = FIZA] then 


[io -xoorar- | let -oea 0 


which follows from Rayleigh’s theorem with u(t) = z(t) — z(t). Thus, the integrated 
approximation error has the same value in the time and frequency domains. 

The above methods are easily modified for the calculation of Fourier series 
coefficients. Specifically, let v(t) be a periodic signal and let z(t) = v()II(t/To), a 
nonperiodic signal consisting of one period of v(?). If you can obtain 


Z(f) = FIVA T1(t/To)] [21a] 
then, from Eq. (14), Sect. 2.1, the coefficients of v(f) are given by 


C, = GR [21b] 
To 
This relationship facilitates the application of transform theorems to Fourier series 
calculations. 

Finally, if the signal is defined in numerical form, its transform can be found via 
numerical calculations. For this purpose, the FFT computer algorithm is especially 
well suited. For details on the algorithm and the supporting theory of discrete 
Fourier transforms, see Oppenheim, Schafer and Buck (1999). 





2.3 TIME AND FREQUENCY RELATIONS 


Rayleigh's theorem and the duality theorem in the previous section helped us draw 
useful conclusions about the frequency-domain representation of energy signals. 
Now we'll look at some of the many other theorems associated with Fourier trans- 
forms. They are included not just as manipulation exercises but for two very practical 
reasons. First, the theorems are invaluable when interpreting spectra, for they express 


2.3 Time and Frequency Relations 


relationships between time-domain and frequency-domain operations. Second, we 
can build up an extensive catalog of transform pairs by applying the theorems to 
known pairs—and such a catalog will be useful as we seek new signal models. 

In stating the theorems, we indicate a signal and its transform (or spectrum) by 
lowercase and uppercase letters, as in V(f) = #[v(t)] and v(t) = 9 ![(V(f)]. This is 
also denoted more compactly by u(t) © V(f). Table T.1 at the back lists the theo- 
rems and transform pairs covered here, plus a few others. 


Superposition 


Superposition applies to the Fourier transform in the following sense. If a, and a, 
are constants and 


u(t) = ayy(t) + a;vs(t) 
then 


S [v(r)] = a Flut] + a.F[v2(t)] 


Generalizing to sums with an arbitrary number of terms, we write the superposition 
(or linearity) theorem as 


Ya ut) € > a, Wf) n] 


This theorem simply states that linear combinations in the time domain become lin- 
ear combinations in the frequency domain. 

Although proof of the theorem is trivial, its importance cannot be overempha- 
sized. From a practical viewpoint Eq. (1) greatly facilitates spectral analysis when 
the signal in question is a linear combination of functions whose individual spectra 
are known. From a theoretical viewpoint it underscores the applicability of the 
Fourier transform for the study of linear systems. 


Time Delay and Scale Change 


Given a time function v(ġ, various other waveforms can be generated from it by 
modifying the argument of the function. Specifically, replacing t by t — £t; produces 
the time-delayed signal u(t — tj). The delayed signal has the same shape as u(t) but 
shifted 7; units to the right along the time axis. In the frequency domain, time delay 
causes an added linear phase with slope —27rt,, so that 


v(t — ty) & V(f)e inh [2] 


If t, is a negative quantity, the signal is advanced in time and the added phase has 
positive slope. The amplitude spectrum remains unchanged in either case, since 


Wg eS VP) en pes VY]. 
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Proof of the time-delay theorem is accomplished by making the change of 
variable A = t — t4 in the transform integral. Thus, using w = 2af for compactness, 
we have 


S[v(t — 1)] = [we ~ tye dt 


TOG 


-| v(A)e AtA dA 


= | v(A)e 79^ an [e 


The integral in brackets is just V(f), so V[v(t — tj] = Vif )e ve. 

Another time-axis operation is scale change, which produces a horizontally 
scaled image of v(?) by replacing t with at. The scale signal v(at) will be expanded 
if |a| < 1 or compressed if |a] > 1; a negative value of æ yields time reversal as well 
as expansion or compression. These effects may occur during playback of recorded 
signals, for instance. 

Scale change in the time domain becomes reciprocal scale change in the fre- 
quency domain, since 


1 
v(at) € — v(£) a0 [3] 
le] Ne 
Hence, compressing a signal expands its spectrum, and vice versa. If a = —1, then 


u(—t) €» V(—f) so both the signal and spectrum are reversed. 
We'll prove Eq. (3) for the case a < 0 by writing a = —|a| and making the 
change of variable A = —|a|t. Therefore, t = A/a, dt = —dA|a], and 


II 


S[v(— |alt)] | v(— |o|t)e 7 dt 


—Ooo 


—] 799 : 
ET | v(A)e FeV" dr 
a 
+00 
1 +00 
= Tal | v(A)e —j?u(f/a) qn 


—oo 


“an 


Observe how this proof uses the general relationship 





—b 


Poa) d(—-A) = -| x(A) dA = | 20) dA 


a —a —b 


Hereafter, the intermediate step will be omitted when this type of manipulation occurs. 


2.3 Time and Frequency Relations 





AITI(t/r) such that 
Zq(t) = u(t — ta) + (-Dv[t — (ta + T)] 
Application of the superposition and time-delay theorems yields 
ZU wona epe De eden 
= V £X eirfa — e Pet) 


where V(f) = Ar sinc fr. 

The bracketed term in Z,(f) is a particular case of the expression e/?^i-- e/?9: 
which often turns up in Fourier analysis. A more informative version of this expres- 
sion is obtained by factoring and using Euler's theorem, as follows: 


eli + eh = [ei + $70.02), 0:02 [4] 


E { 2 cos (0; — 0,)e €:*92 
— (J2 sin (8, — 6;)e/6-*82 


The upper result in Eq. (4) corresponds to the upper (+) sign and the lower result to 
the lower (—) sign. 

In the problem at hand we have 0, = —ft,and 0, = —Tf(t; T),so0, — 0, = 
afT and 0, + 0, = —27ftg where tg = t; + T/2 as marked in Fig. 2.3-1a. Therefore, 
after substituting for V(f), we obtain 


Z(f) = (Ar sinc fr)(j2 sin wf T e 77v) 


Note that Z,(0) = 0, agreeing with the fact that z,(£) has zero net area. 
If tj = 0 and T = 7, z,(t) degenerates to the waveform in Fig. 2.3-1b5 where 


dx SEL 





Figure 2.3-1 Signals in Example 2.3-1. 
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The spectrum then becomes 
ZAf) = (Ar sinc fr)(j2 sin af 7) 
= (jomf 7)AT sinc? f 7 


This spectrum is purely imaginary because z,(t) has odd symmetry. 





Let v(t) be a real but otherwise arbitrary energy signal. Show that if 


z(t) = a,v(t) + a;v(—1) [5a] 


EXERCISE 2.3-1 


then 


Z(f) = (a, + a) Vf) + jla — a) Vf) [5b] 


where V,(f) and V, (f) are the real and imaginary parts of V( f). 





Frequency Translation and Modulation 


Besides generating new transform pairs, duality can be used to generate transform 
theorems. In particular, a dual of the time-delay theorem is 


v(te!* <> Vif — f.) w, = rf, [6] 


We designate this as frequency translation or complex modulation, since multiply- 
ing a time function by e/’ causes its spectrum to be translated in frequency by +f,. 

To see the effects of frequency translation, let v(f) have the bandlimited spec- 
trum of Fig. 2.3-2a, where the amplitude and phase are plotted on the same axes 
using solid and broken lines, respectively. Also let f, > W. Inspection of the trans- 
lated spectrum V(f — f.) in Fig. 2.3-2b reveals the following: 


Wf-fo 







VC) 


arg V( f) 


S 





{a} (b) 


Figure 2.3-2 Frequency translation of a bandlimited spectrum. 
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The significant components are concentrated around the frequency f.. 


2. Though V(f) was bandlimited in W, V (f — f) has a spectral width of 
2W. Translation has therefore doubled spectral width. Stated another way, the 
negative-frequency portion of Vif) now appears at positive frequencies. 


3. W(f-—f.)1s not hermitian but does have symmetry with respect to translated ori- 
gin at f = fo. 


These considerations may appear somewhat academic in view of the fact that 
v(t)e/*' is not a real time function and cannot occur as a communication signal. 
However, signals of the form v(ft) cos (w,t + $) are common—in fact, they are the 
basis of carrier modulation—and by direct extension of Eq. (6) we have the follow- 
ing modulation theorem: 


ip -—jé 
v(t) cos (ost + G) oS vf - £) e f v f) [7] 


In words, multiplying a signal by a sinusoid translates its spectrum up and down in 
frequency by f.. All the comments about complex modulation also apply here. In 
addition, the resulting spectrum is hermitian, which it must be if u(t) cos (w,t + $) 
is a real function of time. The theorem is easily proved with the aid of Euler's theo- 
rem and Eq. (6). 





RF Pulse 


Consider the finite-duration sinusoid of Fig. 2.3-3a, sometimes referred to as an RF 
pulse when f, falls in the radio-frequency band. (See Fig.1.1—2 for the range of fre- 
quencies that supports radio waves.) Since 


t 
z(t) = an(£) COS w, t 
we have immediately 
a 
2 


obtained by setting v(t) = AII(z/7) and V(f) = Ar sinc fr in Eq. (7). The resulting 
amplitude spectrum is sketched in Fig. 2.3—3b for the case of f, >> 1/7 so the two 
translated sinc functions have negligible overlap. 

Because this is a sinusoid of finite duration, its spectrum is continuous and con- 
tains more than just the frequencies f = —f.. Those other frequencies stem from the 
fact that z(f) = 0 for |t| > 7/2, and the smaller 7 is, the larger the spectral spread 
around +f —reciprocal spreading, again. On the other hand, had we been dealing 
with a sinusoid of infinite duration, the frequency-domain representation would be a 
two-sided line spectrum containing only the discrete frequencies +f. 


Z(f) = sine (f — £r + 5 sine (f + fr. 
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Figure 2.3-3 (a) RF pulse; (6) amplitude spectrum. 


Differentiation and Integration 


Certain processing techniques involve differentiating or integrating a signal. The 
frequency-domain effects of these operations are indicated in the theorems below. A 
word of caution, however: The theorems should not be applied before checking to 
make sure that the differentiated or integrated signal is Fourier-transformable; the 
fact that u(t) has finite energy is not a guarantee that the same holds true for its 
derivative or integral. 

To derive the differentiation theorem, we replace u(t) by the inverse transform 
integral and interchange the order of operations, as follows: 


d d m jf 
“yo =4 | | Yu» "ar 
oo d os 
[woe 
= [| Lemrvinle ar 


Referring back to the definition of the inverse transform reveals that the bracketed 
term must be F[du(t)/dt], so 


d 
qi esf) 


and by iteration we get 


2.3 Time and Frequency Relations 


n 


ult) e» G2 VU) i 





which is the differentiation theorem. 

Now suppose we generate another function from v(t) by integrating it over all 
past time. We write this operation as fto v(A) dA, where the dummy variable A is 
needed to avoid confusion with the independent variable ¢ in the upper limit. The 
integration theorem says that if 


Ww(0) = | vo) dA =0 [9a] 
then 
: 1 
| va) dÀ v af Vf) [9b] 


The zero net area condition in Eq. (9a) ensures that the integrated signal goes to zero 
as t — co. (We'll relax this condition in Sect. 2.5.) 
To interpret these theorems, we see that 








Spectral interpretation thus agrees with the time-domain observation that differenti- 
ation accentuates time variations while integration smoothes them out. 
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Triangular Pulse 
The waveform z(t) in Fig. 2.3-1b has zero net area, and integration produces a tri- 
angular pulse shape. Specifically, let 


w(t) — > | z (X) dA = 4(1 x “) LE 
"RS 0 jt} > 7 


which is sketched in Fig. 2.3-4a. Applying the integration theorem to Z,(f) from 
Example 2.3-1, we obtain 


MI 
T j2mf 


as shown in Fig. 2.3-4b. A comparison of this spectrum with Fig. 2.2-2 reveals that 
the triangular pulse has less high-frequency content than a rectangular pulse with 


Wf) = Z,(f) = Av sinc? fr 
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Figure 2.3-4 A triangular pulse and its spectrum. 


amplitude A and duration 7, although they both have area Ar. The difference is 
traced to the fact that the triangular pulse is spread over 27 seconds and does not have 
the sharp, stepwise time variations of the rectangular shape. 

This transform pair can be written more compactly by defining the triangular 


function 
Ir 
i <r 
A(£) 8 T [10] 
7 0 Il] > 7 
Then w(t) = AA(t/T) and 
aa() © Ar sinc? fr n 


It so happens that triangular functions can be generated from rectangular functions 
by another mathematical operation, namely, convolution. And convolution happens 
to be the next item on our agenda. 





EXERCISE 2.3-2 A dual of the differentiation theorem is 


d" 





t"u(t) € Cymy an Wf) [12] 


Derive this relationship for n = 1 by differentiating the transform integral V*[v(7)] 
with respect to f. 





2.4 CONVOLUTION 


The mathematical operation known as convolution ranks high among the tools used 
by communication engineers. Its applications include system analysis and probabil- 
ity theory as well as transform calculations. Here we are concerned with convolution 
in the time and frequency domains. 


2.4 Convolution 


Convolution Integral 


The convolution of two functions of the same variable, say v(f) and w(t), is defined by 


ceo 
v * w(t) £ | v(A)w(t — A) dA [1] 
—o00 
The notation v * w(t) merely stands for the operation on the right-hand side of Eq. (1) 
and the asterisk (*) has nothing to do with complex conjugation. Equation (1) is the 
convolution integral, often denoted v * w when the independent variable is unam- 
biguous. At other times the notation [v(2)] * [w(2)] is necessary for clarity. Note care- 
fully that the independent variable here is t, the same as the independent variable of 
the functions being convolved; the integration is always performed with respect to a 
dummy variable (such as A) and t is a constant insofar as the integration is concerned. 
Calculating v * w(t) is no more difficult than ordinary integration when the two 
functions are continuous for all t. Often, however, one or both of the functions is 
defined in a piecewise fashion, and the graphical interpretation of convolution 
becomes especially helpful. 
By way of illustration, take the functions in Fig. 2.4-la where 


v(t) Ae' O<t<oco 
w(t) = t/T Dew 
For the integrand in Eq. (1), v(A) has the same shape as u(t) and 
[SA 
T 





w(t — A) = O<t-A<T 

But obtaining the picture of w(t — A) as a function of A requires two steps: first, we 
reverse w(7) in time and replace t with A to get w(— A); second, we shift w(—A) to the 
right by t units to get w[—(A — £)] = w(t — A) for a given value of t. Figure 2.4-1b 
shows v(A) and w(t — A) with t < 0. The value of t always equals the distance from 
the origin of v(A) to the shifted origin of w(— A) indicated by the dashed line. 

AS U * w(t) is evaluated for —oo < t < oo, w(t — A) slides from left to right with 
respect to v(À), so the convolution integrand changes with t. Specifically, we see in 
Fig. 2.4-15 that the functions don't overlap when t < 0; hence, the integrand equals 
zero and 


u*w(t) 0  r«0 


WhenO «t « Tas in Fig. 2.4—1c, the functions overlap for 0 < A < t, so t becomes 
the upper limit of integration and 


t 
ALSA 
v * w(t) = | ^e (=>) dÀ 
" T 


—-—(r-1-e") O<#<T 
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0 t-T t 
id) 
Figure 2.4-1 Graphical interpretation of convolution. 


Finally, when t > T as in Fig. 2.4—1d, the functions overlap for t - T < A < t and 


Nx x 
vx w(t) = | Ae ii ) dA 
t-T T 


A 
s aa eje uM. uS 





The complete result plotted in Fig. 2.4—2 shows that convolution is a smoothing oper- 
ation in the sense that v * w(f) 1s "smoother" than either of the original functions. 


2.4 Convolution 


A 
=—(T-lt+et 
T i 


A (T-1 + etyett-9) 
T 





0 T 


Figure 2.4-2 Result of the convolution in Fig. 2.4-1. 


Convolution Theorems 


The convolution operation satisfies a number of important and useful properties. 
They can all be derived from the convolution integral in Eq. (1). In some cases they 
are also apparent from graphical analysis. For example, further study of Fig. 2.4-1 
should reveal that you get the same result by reversing v and sliding it past w, so 
convolution is commutative. This property is listed below along with the associa- 
tive and distributive properties. 


U*W--W*U [2a] 
U*(w*z) = (U* Ww) *Z [2b] 
U * (w +z) = (v*w) + (v *z) [2c] 


All of these can be derived from Eq. (1). 


Having defined and examined the convolution operation, we now list the two 
convolution theorems: 


v *w(t) e V(f)W(f) [3] 
v(t)w(t) & V« W(f) [4] 
These theorems state that convolution in the time domain becomes multiplication in 
the frequency domain, while multiplication in the time domain becomes convolution 
in the frequency domain. Both of these relationships are important for future work. 
The proof of Eq. (3) uses the time-delay theorem, as follows: 


| | | [ awe — A) an [e P 


Flv * w(t) | 


= pu |» cede ae dA 


| voo nei an 


—co 
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- [sen a. wu) = MEW) 


—oo 


Equation (4) can be proved by writing out the transform of v(t)w(t) and replacing 
w(t) by the inversion integral 9» ![W(f)]. 


Trapezoidal Pulse 


To illustrate the convolution theorem—and to obtain yet another transform pair— 
let's convolve the rectangular pulses in Fig. 2.4—3a. This is a relatively simple task 
using the graphical interpretation and symmetry considerations. If 7, > 75, the prob- 
lem breaks up into three cases: no overlap, partial overlap, and full overlap. 
Fig. 2.4-3b shows v(A) and w(t — A) in one case where there is no overlap and 
v * w(t) = 0. For this region 


yd e ert 
2 2 

Or 
(7, +T) 
oa 


t< 


There is a corresponding region with no overlap where t — 74/2 > 7,/2, ort > (7, + 
73/2. Combining these together yields the region of no overlap as |t| > (7, + 7)/2. 
In the region where there is partial overlap t + T2/2 > —7,/2 and t — 1/2 < —7,/2, 


. Which yields 





T 


ME T, T T, T Ty —7 
ot w(t) = | Adda = Aaa t+ 1572) a ee 5 Z 


By properties of symmetry the other region of partial overlap can be found to be 


Tj 


2 T, oT Ty Ty bor 
v * w(t) = | z Adad = Ara =e + 18) fu Qu E eS 
ps 


2 


Finally, the convolution in the region of total overlap is 


T3 
tt 3 


v * w(t) = | „n AiAodÀ = AAT It < 


>T 


T| — T2 


2 





The result is the trapezoidal pulse shown in Fig. 2.4—3c, whose transform will be the 
product V(f)W(f) = (A47, sinc f7,) (A57; sinc fro). 

Now let 7; = 7; = T so the trapezoidal shape reduces to the triangular pulse 
back in Fig. 2.3-4a with A = A,A,7. Correspondingly, the spectrum becomes (A,T 
sinc fT) (Agr sinc fT) =Ar sinc? fr, which agrees with our prior result. 
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Figure 2.4-3 Convolution of rectangular pulses. 





Ideal Lowpass Filter EXAMPLE 2.4-2 


In Section 2.1 we mentioned the impact of the discontinuities introduced in a signal 
as a result of filtering with an ideal filter. We will examine this further by taking the 
rectangular function from Example 2.2-1 u(t) = All(t/7) whose transform, V(f) = 
AT sinc fr, exists for all values of f. We can lowpass filter this signal at f = 1/7 by 
multiplying V(f) by the rectangular function 
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z(t) 


7/2 
Figure 2.4-4 


Wf) = n( 5) e 2 sino( 2) 


The output function is 


"$ 2A 2A 
—— sinc —- dA 
T T 


Ae) = vo) + w(t) = w(t) +000) = | 


This integral cannot be evaluated in closed form; however, it can be evaluated 


numerically using Table T.4 to obtain the result shown in Fig. 2.4—4. Note the simi- 
larity to the result in Fig. 2.1—9b. 





EXERCISE 2.4-1 Let u(t) = A sinc 2Wt, whose spectrum is bandlimited in W. Use Eq. (4) with w(t) = 
u(t) to show that the spectrum of v(t) will be bandlimited in 2W. 





2.5 IMPULSES AND TRANSFORMS IN THE LIMIT 


So far we’ve maintained a distinction between two spectral classifications: line 
spectra that represent periodic power signals and continuous spectra that represent 
nonperiodic energy signals. But the distinction poses something of a quandary when 
you encounter a signal consisting of periodic and nonperiodic terms. We'll resolve 
this quandary here by allowing impulses in the frequency domain for the represen- 
tation of discrete frequency components. The underlying notion of transforms in 
the limit also permits the spectral representation of time-domain impulses and other 
signals whose transforms don’t exist in the usual sense. 


Properties of the Unit Impulse 


The unit impulse or Dirac delta function 6(f) is not a function in the strict mathe- 
matical sense. Rather, it belongs to a special class known as generalized functions 


2.5 Impulses and Transforms in the Limit 


or distributions whose definitions are stated by assignment rules. In particular, the 
properties of ó(t) will be derived from the defining relationship 


fee ne {x0 ERLA " 


; 0 otherwise 
1 


where u(t) is any ordinary function that's continuous at t = 0. This rule assigns a 
number—either v(0) or 0—to the expression on the left-hand side. Equation (1) and 
all subsequent expressions will also apply to the frequency-domain impulse ó(f) by 
replacing t with f. 

If v(t) = 1 in Eq. (1), it then follows that 


| e(t) dt = | ó(t) dt = 1 ^ — B 
with e being arbitrarily small. We interpret Eq. (2) by saying that 5(¢) has unit area 
concentrated at the discrete point t = 0 and no net area elsewhere. Carrying this 
argument further suggests that 


é(t) = 0 t#0 [3] 


Equations (2) and (3) are the more familiar definitions of the impulse, and lead to 
the common graphical representation. For instance, the picture of A ó(f — t4) is 
shown in Fig. 2.5-1, where the letter A next to the arrowhead means that A ó(t — t,) 
has area or weight A located at t = t4. 

Although an impulse does not exist physically, there are numerous conventional 
functions that have all the properties of 6(¢) in the limit as some parameter e goes to 
zero. In particular, if the function 6,(4) is such that 


lim | v(t) &,(t) dt = v(0) [4a] 


then we say that 


lim &(r) = à(r) [4b] 


0| i 


Figure 2.5-1 ^ Graphical representation of A8[t — t). 
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Two functions satisfying Eq. (4a) are 


a() = in(i) is 


1 t 
&j(t) = E sine p [6] 


which are plotted in Fig. 2.5-2. You can easily show that Eq. (5) satisfies Eq. (4a) by 
expanding v(t) in a Maclaurin senes prior to integrating. An argument for Eq. (6) 
will be given shortly when we consider impulses and transforms. 

By definition, the impulse has no mathematical or physical meaning unless it 


appears under the operation of integration. Two of the most significant integration 
properties are 


u(t) * &(t — tj) = v(t — ta) [7] 

v(t) S(t — tj) dt = v(t4) [8] 

both of which can derived from Eq. (1). Equation (7) is a replication operation, 

since convolving v(t) with ó(t — t4) reproduces the entire function v(t) delayed by t4. 

In contrast, Eq. (8) is a sampling operation that picks out or samples the value of 
u(t) at t = tj — the point where d(t — ta) is "located." 

Given the stipulation that any impulse expression must eventually be integrated, 


you can use certain nonintegral relations to simplify expressions before integrating. 
Two such relations are 


u(t) S(t — ta) = v(t4) 8(t — ta) [9a] 
S(at) = Tyo a #0 (obj 
ext) |o 








Njim 
nlm 


Figure 2.5-2 Two functions that become impulses as e = O. 
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which are justified by integrating both sides over —co < t < co. The product relation 
in Eq. (9a) simply restates the sampling property. The scale-change relation in 
Eq. (9b) says that, relative to the independent variable t, 5(af) acts like 8(1)/|a|. Set- 
ting a = —1 then brings out the even-symmetry property 0(—1£) = (ð. 





Impulses in Frequency 


Impulses in frequency represent phasors or constants. In particular, let u(t) = A bea 
constant for all time. Although this signal has infinite energy, we can obtain its 
transform in a limiting sense by considering that 


v(t) = lim A sinc 2Wt = A [10a] 
Now we already have the transform pair A sinc 2Wt &3(A/2W)II(f/2W), so 
S [v(t)] = lim E. II (5) =A O6(f) [105] 
w>0 2W 2W 
which follows from Eq. (5) with e = 2W and t = f. Therefore, 
A € A e( f) [11] 


and the spectrum of a constant in the time domain is an impulse in the frequency 
domain at f = 0. 

This result agrees with intuition in that a constant signal has no time variation 
and its spectral content ought to be confined to f = 0. The impulsive form results 
simply because we use integration to return to the time domain, via the inverse 
transform, and an impulse is required to concentrate the nonzero area at a discrete 
point in frequency. Checking this argument mathematically using Eq. (1) gives 


FTA 8(f)] = | A 6( f )e?"f! dt = Ag?" We 


which justifies Eq. (11) for our purposes. Note that the impulse has been integrated 
to obtain a physical quantity, namely the signal v(t) = A. 

As an alternative to the above procedure, we could have begun with a rectangu- 
lar pulse, AII(t/7), and let 7 — co to get a constant for all time. Then, since 
#(AL(t/7)] = Art sinc fr, agreement with Eq. (11) requires that 


lim A7 sinc fr = A O(f) 


EXERCISE 2.5-1 
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And this supports the earlier assertion in Eq. (6) that a sinc function becomes an 
impulse under appropriate limiting conditions. 


To generalize Eq. (11), direct application of the frequency-translation and mod- 
ulation theorems yields 


Ae A Gf — f.) [12] 
Ae Ae i? 
A cos (e.t + d) > AF 7 f£) + SAF +f) n3 
Thus, the spectrum of a single phasor is an impulse at f = f. while the spectrum of a 


sinusoid has two impulses, shown in Fig. 2.5-3. Going even further in this direction, 
if v(t) is an arbitrary periodic signal whose exponential Fourier series is 


v(t) = M c(nf)e?m [14a] 
then its Fourier transform is 
Uf) = 2; c(nfo) 9(.£ — nfo) [145] 


where superposition allows us to transform the sum term by term. 

By now it should be obvious from Eqs. (11)-(14) that any two-sided line spec- 
trum can be converted to a “continuous” spectrum using this rule: convert the spec- 
tral lines to impulses whose weights equal the line heights. The phase portion of the 
line spectrum is absorbed by letting the impulse weights be complex numbers. 
Hence, with the aid of transforms in the limit, we can represent both periodic and 
nonperiodic signals by continuous spectra. That strange beast the impulse function 
thereby emerges as a key to unifying spectral analysis. 

But you may well ask: What's the difference between the line spectrum and the 
"continuous" spectrum of a period signal? Obviously there can be no physical dif- 
ference; the difference lies in the mathematical conventions. To return to the time 
domain from the line spectrum, we sum the phasors which the lines represent. To 
return to the time domain from the continuous spectrum, we integrate the impulses 
to get phasors. 


A 4 -j¢ 4 eit 





Figure 2.5-3 Spectrum of A cos (w.t + Q). 
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The sinusoidal waveform in Fig. 2.5-4a has constant frequency f, except for the EXAMPLE 2.5-1 
interval — 1/f. < t < 1/f, where the frequency jumps to 2f.. Such a signal might be 
produced by the process of frequency modulation, to be discussed in Chap. 5. Our 
interest here is the spectrum, which consists of both impulsive and nonimpulsive 
components. 
For analysis purposes, we'll let 7 = 2/f, and decompose v(t) into a sum of three 
terms as follows: 


v(t) = A cos wt — ATI(t/T) cos w,t  AIl(t/r) cos 2o,t 


The first two terms represent a cosine wave with a “hole” to make room for an RF 
pulse at frequency 2f, represented by the third term. Transforming v(t) term by term 
then yields 

A 
2 


R S [o — A) OF Fz.) 


-f (sinc (fair sinc (f+ fer) 


+ > [sinc (f — 2f,)r + sine (f + 2£)7] 


where we have drawn upon Eq.(13) and the results of Example 2.3-2. The ampli- 
tude spectrum is sketched in Fig. 2.5-4b, omitting the negative-frequency portion. 
Note that |V(/)| is not symmetric about f = f, because the nonimpulsive component 
must include the term at 2f. 





WC Al2 








- (b 


Figure 2.5-4 Waveform and amplitude spectrum in Example 2.5-1. 
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Step and Signum Functions 


We've seen that a constant for all time becomes a dc impulse in the frequency 
domain. Now consider the unit step function in Fig. 2.5—5a which steps from “off” 
to "on" at t = 0 and is defined as 


1 t0 


This function has several uses 1n Fourier theory, especially with regard to causal 
signals since any time function multiplied by u(t) will equal zero for t < 0. However, 
the lack of symmetry creates a problem when we seek the transform in the limit, 
because limiting operations are equivalent to contour integrations and must be 
performed in a symmetrical fashion—as we did in Eq. (10). To get around this prob- 
Jem, we'll start with the signum function (also called the sign function) plotted in 
Fig. 2.5—5b and defined as 


i > 
n ae [16] 


t= 
em 2s] t<0 
which clearly has odd symmetry. 


The signum function is a limited case of the energy signal z(r) in Fig. 2.5-6 
where v(t) = e^"'u(t) and 


rec t>0 


a) = v) - 4-9 = (55 


t «0 
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Figure 2.5~5 (c) Unit step function; {b} signum function. 
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z(t) 





Figure 2.5-6 


so that z(t) — sgn t if b — 0. Combining the results of Example 2.2-2 and Exercise 
2.3-1 yields 


—Jj^mf 
F(z(t)] = Zf) = J2W(f) = PX Qafy 
Therefore, 
; E] 
gj € = = — 
and we have the transform pair 
sgn t<> x [171 


ivf 


We then observe from Fig. 2.5—5 that the step and signum functions are related by 
u(t) = i(sgn t + 1) =4sgnt 1 


Hence, 


1 1 
u(t) V + 7 e( f) [18] 
since F[1/2] = 18&(f). 

Note that the spectrum of the signum function does not include a dc impulse. 
This agrees with the fact that sgn tis an odd function with zero average value when 
averaged over all time, as in Eq. (9), Sect. 2.1. In contrast, the average value of the 
unit step is <u(f)> = 1/2 so its spectrum includes $6(f)—just as the transform of a 
periodic signal with average value c(0) would include the dc term c(0) 6(f). 

An impulsive dc term also appears in the integration theorem when the signal 
being integrated has nonzero net area. We derive this property by convolving u(® 
with an arbitrary energy signal v(t) to get ` 


v * ult) = | voouc — À)dAÀ (19] 
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- i v(A) dÀ 


00 


since u(t — A) = 0 for A > t. But from the convolution theorem and Eq. (18) 


Fv «0 = VA] sa + 2800, 
SO 


| | v(A) dà € —— Wf) + 5 VQ) & f) [20] 


-- 
IMs j2wf 


where we have used V(f) 6(f) = V(0) 5(/). Equation (20) reduces to our previous 
statement of the integration theorem when V(0) = 





EXERCISE 2.5-2 Apply the modulation theorem to obtain the spectrum of the causal sinusoid v(t) — 
Au(t) cos ot. 





Impulses in Time 


Although the time-domain impulse (7) seems a trifle farfetched as a signal model, 
we'll run into meaningful practical applications in subsequent chapters. Equally 
important is the value of 6(f) as an analytic device. To conve its transform, we let 
T — 0 in the known pair 


A t 
= n(+) <> A sinc ft 
T T 

which becomes 


A &(t) A [211 


Hence, the transform of a time impulse has constant amplitude, meaning that its 
spectrum contains all frequencies in equal proportion. 

You may have noticed that A ó(f) +> A is the dual of A €» A 6(/). This dual rela- 
tionship embraces the two extremes of reciprocal spreading in that 





Applying the time-delay theorem to Eq. (21) yields the more general pair 
A S(t — tj) <> Ae "fu [22] 
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It’s a simple matter to confirm the direct transform relationship #[A6(t — t,)] = 
Ae 7?"f«- consistency therefore requires that # ![Ae u] = A &(t — ty), which 
leads to a significant integral expression for the unit impulse. Specifically, since 


co 
Fo" eg Fle] = | e J27F tag rft df 
—oco 


we conclude that 


| eT f0799 df = &(t — ta) [23] 
Thus, the integral on the left side may be evaluated in the limiting form of the unit 
impulse—a result we'll put immediately to work in a proof of the Fourier integral 
theorem. 

Let v(f) be a continuous time function with a well-defined transform V(f) = 
#([v(f)]. Our task is to show that the inverse transform does, indeed, equal v(t). From 
the definitions of the direct and inverse transforms we can write 


FTV f)] = n | [vene an [een df 


= | o» | e ITENS af dn 
But the bracketed integral equals 6(t — A), from Eq. (23), so 
FTV f)] = | v(A) S(t — A) dA = v(t) * S(t) [24] 


Therefore 9^! [V(f)] equals v(t) in the same sense that v(A * S(t) = v(t). A more 
rigorous proof, including Gibbs' phenomenon at points of discontinuity, is given by 
Papoulis (1962, Chap. 2). 

Lastly, we relate the unit impulse to the unit step by means of the integral 


£ 1 t> ti 
O(A — ta) dA = 25 
E ( a) f t< ty i 
= u(t — ty) 


_ Differentiating both sides then yields 


d 
S(t — ta) = E u(t — ta) [26] 


which provides another interpretation of the impulse in terms of the derivative of a 
step discontinuity. 

Equations (26) and (22), coupled with the differentiation theorem, expedite cer- 
tain transform calculations and help us predict a signal's high-frequency spectral 
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rolloff. The method is as follows. Repeatedly differentiate the signal in question 
until one or more stepwise discontinuities first appear. The next derivative, say the 
nth, then includes an impulse A, S(t — t) for each discontinuity of height A, at t = 
thy SO 


d" 
at" 





u(t) = w(t) + >) Ag S(t — t) [27a] 
k 
where w(t) is a nonimpulsive function. Transforming Eq. (27a) gives 
(2nfY Wf) = W(f) + >) A, e Pre [275] 
k 


which can be solved for V(f) if we know W(f) = F [w(1)]. 

Furthermore, if |W(/)| — 0 as f oo, the high-frequency behavior of |V(f)| will 
be proportional to |f|"" and we say that the spectrum has an nth-order rolloff. A 
large value of n thus implies that the signal has very little high-frequency content— 
an important consideration in the design of many communication systems. 





EXAMPLE 2.5-2 


Raised Cosine Pulse 


Figure 2.5—7a shows a waveform called the raised cosine pulse because 


v(t) = ZU + cos 2 \ni( 4) 


We’ll use the differentiation method to find the spectrum V(f) and the high- 
frequency rolloff. The first three derivatives of v(t) are sketched in Fig. 2.5—7b, and 


we see that 
dv(t) BE n *u( t ) 
= si 
dt T/2 T 2T 


which has no discontinuities. However, d?v(tyd£ is discontinuous at t = +7 so 


d? TVA . mt t TVA TVA 
ap o = (z) A sn T n( 4) + (2) 59 "oo (=) i m T) 


This expression has the same form as Eq. (27a), but we do not immediately know 
the transform of the first term. 

Fortunately, a comparison of the first and third derivatives reveals that the first 
term of d^v(f)/dP can be written as w(t) = —(a/7)? dv(t)dt. Therefore, W(f) = 
— (ary (j20f)V(f) and Eq. (27b) gives 








TVA 


Gom) = -(S Juasmwn + (ZY Stern - emm 


v(t) 


A 








(b) 


Ivf)! 
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nN] > 


AT 
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1/27 


Figure 2.5-7 


(d 


Raised cosine pulse. (a) Waveform; (b) derivatives; (c) amplitude spectrum. 
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EXERCISE 2.5-3 





2.1-1 


2.1-2 


2.1-3 
2.1-4 
2.1-5 


2.1-6 


CHAPTER2 € Signals and Spectra 


Routine manipulations finally produce the result 
JA sin 2rfr | Ar sinc 2f7 
janf + (r/m (jmf) 1- (2fr)? 


whose amplitude spectrum is sketched in Fig. 2.5—7c for f = 0. Note that |V(f)| has 
a third-order rolloff (n = 3), whereas a rectangular pulse with |V(f)| = |sinc f7| = 
K(sin afr) (mrfr)| would have only a first-order rolloff. 


Vr) 


Let v(t) = QAt/7)II(t/r). Sketch dv(t)/dt and use Eq. (27) to find V( f). 


2.6 PROBLEMS 
Consider the phasor signal v(t) = Ae/?e?""/v, Confirm that Eq. (14) yields just one 
nonzero coefficient c,, having the appropriate amplitude and phase. 


If a periodic signal has the even-symmetry property v(—£) = v(0), then Eq. (14) 
may be written as 


T2 
ge | v(t) cos (2ant/To) dt 
To Jy 


Use this expression to find c, when v(f) = A for |t| < Ty4 and v(t) = —A for Ty4 < 
l| < Ty2. As a preliminary step you should sketch the waveform and determine cy 
directly from (ur). Then sketch and label the spectrum after finding c,. 


Do Prob. 2.1-2 with v() = A — 2A|I|/T, for |t| < Ty2. 
Do Prob. 2-1-2 with v(£) = A cos (27rt/Tp) for |t] < Ty2. 


If a periodic signal has the odd-symmetry property u(—1) = —v(f), then Eq. (14) 
may be written as 


> [72 
gm | u(t) sin (2arnt/T,) dt 
To o 


Use this expression to find c, when v(t) = A for 0 < t < Ty2 and v( ^ —A for 
—Ty/2 < t < 0. As a preliminary step you should sketch the waveform and deter- 
mine c directly from (v(f)). Then sketch and label the spectrum after finding c,- 


Do Prob. 2.1-5 with v(t) = A sin(27t/Tp) for |t| < Ty2. 


*Indicates answer given in the back of the book. 
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2.1-8* 


2.1-9 


2.1-10 
2.1-11¥ 


2.1-12t 


2.2-1 


2.22 


2.2-3 


2.2-4* 
2.2-5 
2.2-6* 


2.2-7 


2.2-8 


2.2-9 
2.2-10* 
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Consider a periodic signal with the half-wave symmetry property (v(ttTy2) = 
—v(t)), so the second half of any period looks like the first half inverted. Show that 
C, = 0 for all even harmonics. 


Use Parseval's power theorem to calculate the average power in the rectangular 
pulse train with 7/T, = 1/4 if all frequencies above |f| > 1/7 are removed. Repeat 
for the cases where all frequencies above |f| > 2/r and |f| > 1/27 are removed. 


Let v(£) be the triangular wave with even symmetry listed in Table T.2, and let v'(2) 
be the approximating obtained with the first three nonzero terms of the trigonomet- 
ric Fourier series. (a) What percentage of the total signal power is contained in v’ (£)? 
(b) Sketch v'(t) for |t| < Ty2. 


Do Prob. 2.1—9 for the square wave in Table T.2. 


Calculate P for the sawtooth wave listed in Table T.2. Then apply Parseval's power 
theorem to show that the infinite sum 1/1? + 1/2? + 1/3? + - - - equals 7/6. 


Calculate P for the triangular wave listed in Table T.2. Then apply Parseval's power 
theorem to show that the infinite sum 1/14 + 1/34 + 1/5* + - - - equals 774/96. 


Consider the cosine pulse v(t) = Acos(mi/r)II(t/r). Show that V(f) = (Ar/2) 


[sinc(fr — 1/2) + sinc(fr + 1/2]. Then sketch and label |V(f)| for f = 0 to verify 
reciprocal spreading. 


Consider the sine pulse v(t) = Asin(27t/r)II(t/r). Show that V(f) = —j(AT/2) 
[sinc(fr — 1) — sinc(fr + 1)]. Then sketch and label |V(f)| for f = 0 to verify recip- 
rocal spreading. 


Find V(f) when v(t) = (A — Alt|/r)II(t/27). Express your result in terms of the sinc 
function. 


Find V(f) when v(?) = (At/7)II(t/27). Express your result in terms of the sinc function. 
Use Rayleigh's theorem to calculate the energy in the signal u(t) = sinc2Wt. 


Let v(t) be the causal exponential pulse in Example 2.2-2. Use Rayleigh’s theorem 
to calculate the percentage of the total energy contained in |f| < W when W = b/27 
and W = 2b/zr. 


Suppose the left-hand side of Eq. (17) had been written as 


|. v(t)w(t) dt 


—oo 


Find the resulting right-hand side and simplify for the case when v(ż) is real and 


w(t) = v(t). 


Show that #[w*()] = W '(—f). Then use Eq. (17) to obtain a frequency-domain 
expression for f^^ v(t)z(t)dt. 


Use the duality theorem to find the Fourier transform of v(t) = sinc2t/z. 


Apply duality to the result of Prob. 2.2-1 to find z(t) when Z(f) = Acos(zf/2W) 
TICf/2W). 
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Apply duality to the result of Prob. 2.2-2 to find z(t) when Z(f) = —jAsin(m//W) 
ICf/2W). 


Use Eq. (16) and a known transform pair to show that 
| (a? + x?y? dx = m/4a? 
0 


Let v(t) be the rectangular pulse in Fig. 2.2-1a. Find and sketch Z(f) for z(t) = 
v(t — T) + v(t + T) taking 7 <T. 


Repeat Prob. 2.3—1 for z(t) = u(t — 2T) + 2u(t) + v(t + 2T). 
Repeat Prob. 2.3—1 for z(t) = v(t — 2T) — 2u(t) + v(t + 2T). 
Sketch v(7) and find V(f) for 


v(t) = an(! Lj E an( ER 


Sketch v(£) and find V(f) for 


v(t) = an( a.) + su(! =) 


Find Z(f) in terms of V(f) when z(t) = v(at — ty). 

Prove Eq. (6) (p. 48). 

Use Eq. (7) to obtain the transform pair in Prob. 2.2-1. 

Use Eq. (7) to obtain the transform pair in Prob. 2.2-2. 

Use Eq. (7) to find Z(f) when z(t) = Ae™ cos wt. 

Use Eq. (7) to find Z(f) when z(t) = Ae™ sin w,t for t = 0 and z(t) = 0 for t < 0. 
Use Eq. (12) to do Prob. 2.2-4. 

Use Eq. (12) to find Z(f) when z(t) = Ate-*, 

Use Eq. (12) to find Z(f) when z(t) = Ate™ for t = 0 and z(t) = O fort < 0. 


Consider the Gaussian pulse listed in Table T.1. Generate a new transform pair by: 
(a) applying Eq. (8) with n = 1; (b) applying Eq. (12) with n = 1. 


Find and sketch y(t) = v'w(t) when u(t) = t for 0 < t < 2 and w(t) =A for t > 0. 
Both signals equal zero outside the specified ranges. 


Do Prob. 2.4—1 with w(t) = Afor0 «t «3. 
Do Prob. 2.4—1 with w() = AforO « £ « 1. 


Find and sketch y(t) = v*w(t) when v(t) = 211(5*), w(t) = A for t = 4, and 
w(t) = 0 otherwise. 
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Do Prob. 2.4-4 with w(t) = e^? for t > 0 and w(t) = 0 otherwise. 
Do Prob. 2.4-4 with w(t) = A(£). 


Find y(t) = v*w(t) for v(t) = Ae^* for t > 0 and w(t) = Be" for t > 0. Both signals 
equal zero outside the specified ranges. 


Do Prob. 2.4—7 with w(t) = sin wt for 0 € t x 2, w(t) = 0 otherwise. (Hint: express 
sinusoid as a sum of exponentials) 


Prove Eq. (2a) from Eq. (1) (p. 53). 


Let u(t) and w(t) have even symmetry. Show from Eq. (1) that v'w(f) will have even 
symmetry. 

Let u(t) and w(t) have odd symmetry. Show from Eq. (1) that v*w(¢) will have odd 
symmetry. 

Find and sketch v'v'u when v(t) = II(7). You may use the symmetry property 
stated in Prob. 2.4—10. 

Use Eq. (3) to prove Eq. (2b) (p. 55). 

Find and sketch y(t) = v*w(t) when v(f) = sinc 4t and w(t) = 2 sinc 5. 


Consider the signal z(t) and its transform Z(f) from Example 2.3-2. Find z( and 
Z(f) as T 3 0. 


Let v(t) be a periodic signal whose Fourier series coefficients are denoted by c,(nfo). 
Use Eq. (14) and an appropriate transform theorem to express c,(nf5) in terms of 
c,(nfo) when w(t) = u(t — ty). 


Do Prob. 2.5-2 with w(t) = du(t)/dt. 
Do Prob. 2.5-2 with w(t) = v(t) cos mwof. 


Let v(t) = A for 0 < t < 2r and v(?) = 0 otherwise. Use Eq. (18) to find V(f). Check 
your result by writing v(f) in terms of the rectangle function. 

Let v(t) = A for |t| > v and v(t) = 0 otherwise. Use Eq. (18) to find V(f). Check 
your result by writing v(f) in terms of the rectangle function. 

Let v(t) = A for t < —T, and u(t) = —A for t > T, and u(t) = 0 otherwise. Use Eq. 
(18) to find V( f). Check your result by letting T — 0. 

Let 


w(t) — | v(A) da 


—Cco 


with v(t) = (1/e)II(t/e). Sketch w(t) and use Eq. (20) to find W(f). Then let e > 0 
and compare your results with Eq. (18). 


Do Prob. 2.5-8 with v(t) = (1/e)e "^ u(t). 


Obtain the transform of the signal in Prob. 2.3-1 by expressing z(t) as the convolu- 
tion of u(t) with impulses. 
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Do Prob. 2.5—10 for the signal in Prob. 2.3-2. 
Do Prob. 2.5--10 for the signal in Prob. 2.3-3. 
8 


Find and sketch the signal v(t) = > sin (27t)6(t — 0.5n) using Eq. (9a). 
n=0 
10 

Find and sketch the signal v(t) = $, cos (2mt)é(t — 0.17) using Eq. (9a). 


n--10 
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S transmission is the process whereby an electrical waveform gets from one location to another, ideally arriv- 
ing without distortion. In contrast, signal filtering is an operation that purposefully distorts a waveform by altering 
its spectral content. Nonetheless, most transmission systems and filters have in common the properties of linearity and 
time invariance. These properties allow us to model both transmission and filtering in the time domain in terms of the 
impulse response, or in the frequency domain in terms of the frequency response. 

This chapter begins with a general consideration of system response in both domains. Then we'll apply our 
results to the analysis of signal transmission and distortion for a variety of media and systems such as fiber optics 
and satellites. We'll examine the use of various types of filters and filtering in communication systems. Some related 
topics—notably transmission loss, Hilbert transforms, and correlation—are included as starting points for subse- 
quent development. 


OBJECTIVES 





After studying this chapter and working the exercises, you should be able to do each of the following: 
1. State and apply the input-output relations for an LTI system in terms of its impulse response A(z), step response 
g(t), or transfer function H(f) (Sect. 3.1). 
2. Use frequency-domain analysis to obtain an exact or approximate expression for the output of a system (Sect. 3.1). 
3. Find H(f) from the block diagram of a simple system (Sect. 3.1). 
4. Distinguish between amplitude distortion, delay distortion, linear distortion, and nonlinear distortion (Sect. 3.2). 
5 


Identify the frequency ranges that yield distortionless transmission for a given channel, and find the equalization 
needed for distortionless transmission over a specified range (Sect. 3.2). 


6. Use dB calculations to find the signal power in a cable transmission system with amplifiers (Sect. 3.3). 
7. Discuss the characteristics of and requirements for transmission over fiber optic and satellite systems (Sect. 3.3). 
8. Identify the characteristics and sketch H(f) and h(t) for an ideal LPF, BPF, or HPF (Sect. 3.4). 
9. Find the 3-dB bandwidth of a real LPF, given H(f) (Sect. 3.4). 
10. State and apply the bandwidth requirements for pulse transmission (Sect. 3.4). 
li. State and apply the properties of the Hilbert transform (Sect. 3.5). 


12. Define the crosscorrelation and autocorrelation functions for power or energy signals, and state their properties 
(Sect. 3.6). 


13. State the Wiener-Kinchine theorem and the properties of spectral density functions (Sect. 3.6). 


14. Given H(f) and the input correlation or spectral density function, find the output correlation or spectral density 
(Sect. 3.6). 





3.1 RESPONSE OF LTI SYSTEMS 


Figure 3.1-1 depicts a system inside a "black box" with an external input signal 
x(t) and an output signal y(t). In the context of electrical communication, the sys- 
tem usually would be a two-port network driven by an applied voltage or current at 
the input port, producing another voltage or current at the output port. Energy stor- 


d 


3.1 Response of LTI Systems 


Black box 
Input Output 


age elements and other internal effects may cause the output waveform to look quite 
different from the input. But regardless of what's in the box, the system is character- 
ized by an excitation-and-response relationship between input and output. 

Here we're concerned with the special but important class of linear time- 
invariant systems—or LTI systems for short. We'll develop the input-output rela- 
tionship in the time domain using the superposition integral and the system's 
impulse response. Then we'll turn to frequency-domain analysis expressed in terms 
of the system's transfer function. 


Figure 3.1-1 


Impulse Response and the Superposition Integral 


Let Fig. 3.1-1 be an LTI system having no internal stored energy at the time the 


input x(t) is applied. The output y(t) is then the forced response due entirely to x(t), 
as represented by 


y(t) = F[x(t)] [1] 


where F[x(1)] stands for the functional relationship between input and output. The 
linear property means that Eq. (1) obeys the principle of superposition. Thus, if 


x(t) = 5 Ak xt) [2a] 
k 
where a, are constants, then 
y(t) = 2a, Fiy] [25] 
k 


The time-invariance property means that the system’s characteristics remain fixed 
with time. Thus, a time-shifted input x(t — t4) produces 


F[x(t — ta)] = y(t — ta) [3] 
so the output is time-shifted but otherwise unchanged. 

Most LIT systems consist entirely of lumped-parameter elements (such as 
resistors capacitors, and inductors), as distinguished from elements with spatially 
distributed phenomena (such as transmission lines). Direct analysis of a lumped- 
parameter system starting with the element equations leads to the input-output rela- 
tion as a linear differential equation in the form 

d^y(t dy(t d"x(t dx(t 
X0 oe) egaga 


a -+a + agy(t) =b +b 
" dt” ES ao Xt) = bs dt^ | dt 











T bo x(t) [4] 
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where the a’s and b’s are constant coefficients involving the element values. The 
number of independent energy-storage elements determines the value of n, known 
as the order of the system. Unfortunately, Eq. (4) doesn't provide us with a direct 
expression for y(t). 


To obtain an explicit input-output equation, we must first define the system's 
impulse response 


A(t) = F[8(t)] [5] 


which equals the forced response when x(t) = 8(t). But any continuous input signal 
can be written as the convolution x(t) = x(t) * 8(t), so 


y(t) = r| | x(A)8(t — A) an 


= | “AAFS — AJ] d 


in which the interchange of operations is allowed by virtue of the system's linearity. 
Now, from the time-invariance property, F[5(t — A)] = A(t — A) and hence 


y(t) = | x(A)A(t — A) dA [6a] 
z | h(A)x(t — A) dÀ [6b] 


where we have drawn upon the commutivity of convolution. 

Either form of Eq. (6) is called the superposition integral. It expresses the 
forced response as a convolution of the input x(f) with the impulse response A(t). 
System analysis in the time domain therefore requires knowledge of the impulse 
response along with the ability to carry out the convolution. 

Various techniques exist for determining A(t) from a differential equation or 
some other system model. But you may be more comfortable taking x(t) = u(t) and 
calculating the system's step response 


a(t) = F[u(t)] [7a] 
from which () 
dg(t 

Myr [7b] 


This derivative relation between the impulse and step response follows from the 
general convolution property 
d dw(t 
di [o x w(t)] = v(t) « E 


dt 
Thus, since g(t) = h *u(f) by definition, dg(n/dt = h(t) « [du(tydt] = h(t) * 6) = h(t). 
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Time Response of a First-Order System Dane EXAMPLE 3.1-1 


The simple RC circuit in Fig. 3.1-2 has been arranged as a two-port network with 
input voltage x(t) and output voltage y(t). The reference voltage polarities are indi- 
cated by the +/— notation where the assumed higher potential is indicated by the + 
sign. This circuit is a first-order system governed by the differential equation 


dy(t) 


RC —— 
dt 


+ y(t) = x(t) 
Similar expressions describe certain transmission lines and cables, so we’re particu- 
larly interested in the system response. 

From either the differential equation or the circuit diagram, the step response is 
readily found to be 


g(t) = (1 — e#°)u(t) [8a] 


Interpreted physically, the capacitor starts at zero initial voltage and charges toward 
y(co) = 1 with time constant RC when x(t) = u(t). Figure 3.1—3a plots this behav- 
ior, while Fig. 3.1-3b shows the corresponding impulse response 


h(t) = aoe u(t [65] 
obtained by differentiating g(t). Note that g(t) and h(t) are causal waveforms since 
the input equals zero fort < 0. 

The response to an arbitrary input x(t) can now be found by putting Eq. (85) in 
the superposition integral. For instance, take the case of a rectangular pulse applied 
at t = 0, so x(t) = A for 0 < t < rt. The convolution y(t) = A » x(t) divides into 
three parts, like the example back in Fig. 2.4—1 with the result that 


0 t<0 
y(t) = 4 A(1 — e^" C) 0crt«c [9] 
A(1 = g^ THRE e eye t>r 


as sketched in Fig. 3.1—4 for three values of 7/RC. 


i x(t) C x 
H | 


Figure 3.1-2 RC lowpass filter. 
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Figure 3.1-3 Output of an RC lowpass filter. (a) Step response; {b} impulse response. 





Figure 3.1-4 Rectangular pulse response of an RC lowpass filter. (a) r >> RC; {b} 7 = RC; 
(c) 7 « RC. 





EXERCISE 3. 1-1 Let the resistor and the capacitor be interchanged in Fig. 3.1-2. Find the step and 
impulse response. 





Transfer Functions and Frequency Response 


Time-domain analysis becomes increasingly difficult for higher-order systems, and 
the mathematical complications tend to obscure significant points. We'll gain a dif- 





demam 
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ferent and often clearer view of system response by going to the frequency domain. 
As a first step in this direction, we define the system transfer function to be the 
Fourier transform of the impulse response, namely, 


H(f) Ê 9(h(0] = | T WeI? at [10] 


~o 


This definition requires that H( f) exists, at least in a limiting sense. In the case of an 
unstable system, h(t) grows with time and H(f) does not exist. 
When A(t) is a real time function, H(f) has the hermitian symmetry 


H(-f) = H*(f) (Val 
so that 
|A(-f)|=|A(f)| | arg (7f) = —arg H( f) [115] 


We’ ll assume this property holds unless otherwise stated. 


The frequency-domain interpretation of the transfer function comes from 
y(t) = h * x(t) with a phasor input, say 


x(t) = Aeh eih! "Og e fw œ [12a] 


The stipulation that x(t) persists for all time means that we're dealing with steady- 


state conditions, like the familiar case of ac steady-state circuit analysis. The steady- 
state forced response is 


y(t) = | h(A)A , eJ enfe) gy 


~o 


co 
= | | Haye Pea A, e^ eirfa 


—co 


= H(fo)A. ett: e J2nfot 


where, from Eq. (10), H(fo) equals H(f) with f = fy. Converting H(fp) to polar 
form then yields 


y(t) = Ape eP =o < t < oo [12b] 
in which we have identified the output phasor's amplitude and angle 
Ay=|H(f)|A, $y = arg H(fo) * 6. 113] 
Using conjugate phasors and superposition, you can similarly show that if 
x(t) = A, cos (2mfot + p.) 
then 
y(t) = A, cos (Zufot + $y) 
with A, and $, as in Eq. (13). 
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Since A,/A, = | H(fo)| at any frequency fo, we conclude that | (f ) | represents 
the system's amplitude ratio as a function of frequency (sometimes called the 
amplitude response or gain). By the same token, arg H(f) represents the phase 
shift, since p, — $, = arg H(fo). Plots of | H(f)| and arg H(f) versus frequency 
give us the frequency-domain representation of the system or, equivalently, the sys- 
tem's frequency response. Henceforth, we'll refer to H(f) as either the transfer 
function or frequency-response function. 

Now let x(f) be any signal with spectrum X(f). Calling upon the convolution 
theorem, we take the transform of y(t) = A * x(t) to obtain 


YF) = Af )X(f) [14] 








This elegantly simple result constitutes the basis of frequency-domain system analy- 
sis. It says that 





The corresponding amplitude and phase spectra are 


ICA) = UCOJMXCEI 
arg Y( f) = arg H(f) + arg X(f) 


which compare with the single-frequency expressions in Eq. (13). If x(t) is an 
energy signal, then y(t) will be an energy signal whose spectral density and total 
energy are given by 


I 


IY P) = LCA YP)? [15a] 
B= [^ IBCOPIRCOP df m 


as follows from Rayleigh's energy theorem. 

Equation (14) sheds new light on the meaning of the system transfer function 
and the transform pair h(t) <> H(f). For if we let x(t) be a unit impulse, then 
X(f) = 1 and Y(f) = A(f)—in agreement with the definition y(t) = A(t) when 

x(t) = e(t). From the frequency-domain viewpoint, the "flat" input spectrum 
X(f) = 1 contains all frequencies in equal proportion and, consequently, the output 
spectrum takes on the shape of the transfer function H(/). 

Figure 3.1-5 summarizes our input-output relations in both domains. Clearly, 
when H(f) and X(f) are given, the output spectrum Y(f) is much easier to find than 
the output signal y(t). In principle, we could compute y(f) from the inverse transform 

foo 
y(t) = FAXE) = | HOAX fe" df 
-00 
But this integration does not necessarily offer any advantages over time-domain 
convolution. Indeed, the power of frequency-domain system analysis largely 
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Input System Output 
x(t) h(t) y(t) = h « x(t) 
X(f) HG) KF) = HPX) 
Figure 3.1-5 Input-output relations for an LTI system. 


depends on staying in that domain and using our knowledge of spectral properties to 
draw inferences about the output signal. 


Finally, we point out two ways of determining H(f) that don’t involve A(t). If 
you know the differential equation for a lumped-parameter system, you can imme- 
diately write down its transfer function as the ratio of polynomials 


b,(j2mf)" + > + bi(J2mf) + bo 
an j2mf)" + + + a,(j2af) + ao 
whose coefficients are the same as those in Eq. (4). Equation (16) follows from 
Fourier transformation of Eq. (4). 


Alternatively, if you can calculate a system’s steady-state phasor response, 
Eqs. (12) and (13) show that 





A(f) = [16] 


H(f)- xe) when = x(t) = e?" [17] 
x(t) 
This method corresponds to impedance analysis of electrical circuits, but is equally 
valid for any LTT system. Furthermore, Eq. (17) may be viewed as a special case of 
the s domain transfer function H(s) used in conjunction with Laplace transforms. 
Since s = o + jw in general, H( f) is obtained from H(s) simply by letting s = j2af. 
These methods assume, of course, that the system is stable. 





Frequency Response of a First-Order System EXAMPLE 3.1-2 


The RC circuit from Example 3.1-1 has been redrawn in Fig. 3.1-6a with the 
impedances Zp = R and Ze = l/joC replacing the elements. Since y(t)/x(t) = 
Zcel(Zc + Zp) when x(t) = e/*', Eq. (17) gives 





2 Q/j2sfC) —.— 1 
BU (1/j2mfC) +R 1 + j2mfRC 
— M [18a] 
1 cJ B) 


where we have introduced the system parameter 


l 
2r RC 


Io 





B [18b] 
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Identical results would have been obtained from Eq. (16), or from H(f ) = #[h(t)]. 
(In fact, the system's impulse response has the same form as the causal exponential 
pulse discussed in Example 2.2—2.) The amplitude ratio and phase shift are 

1 


PAESE CPU 


as plotted in Fig. 3.1-6b for f = 0. The hermitian symmetry allows us to omit 
f < 0 without loss of information. 

The amplitude ratio | H(f)| has special significance relative to any frequency- 
selective properties of the system. We call this particular system a lowpass filter 
because it has almost no effect on the amplitude of low-frequency components, say 
|f| << B, while it drastically reduces the amplitude of high-frequency components, 
say |f| >> B. The parameter B serves as a measure of the filter’s passband or 
bandwidth. 

To illustrate how far you can go with frequency-domain analysis, let the input x(t) 
be an arbitrary signal whose spectrum has negligible content for |f | > W. There are 
three possible cases to consider, depending on the relative values of B and W: 


arg H( f) = -arctan Z [18d 


1. IfW < B,as shown in Fig. 3.1—7a, then |H(f)| ~ 1 and arg H(f ) ~ O over the 
signal’s frequency range | f| < W. Thus, Y(f) = H(f)X(f) ~ X(f) and y(t) = 
x(t)so we have undistorted transmission through the filter. 

2. If W - B, as shown in Fig. 3.1—7b, then Y( f ) depends on both H(f) and X(f). 
We can say that the output is distorted, since y(t) will differ significantly from 
x(t), but time-domain calculations would be required to find the actual waveform. 


IHC fy 


1.0 
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Figure 3.1-6 RC lowpass filter.(a) circuit; (b) transfer function. 
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lH( f | 
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IY(f)! 
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Figure 3.1-7 Frequency-domain analysis of a first-order lowpass filter. (a) B >> W; (b) B = 
W; (c) BK W. 


3. If W > B, as shown in Fig. 3.1—7c, the input spectrum has a nearly constant 
value X(0) for |f| < B so Y(f£) ~ X(0)H(f). Thus, y(t) ^ X(0)A(t), and the 
output signal now looks like the filter's impulse response. Under this condition, 
we can reasonably model the input signal as an impulse. 


Our previous time-domain analysis with a rectangular input pulse confirms 
these conclusions since the nominal spectral width of the pulse is W — 1/7. The 
case W «& B thus corresponds to 1/7 << 1/23RC or t/RC >> 1, and we see in 
Fig. 3.1-4a that y(t) = x(t). Conversely, W >> B corresponds to 7/RC «& 1 as in 
Fig. 3.1-4c where y(t) looks more like k(t). 





Find H(f ) when Z; = joL replaces Zç in Fig. 3.1-6a. Express your result in terms 
of the system parameter fp = R/27L, and justify the name “highpass filter" by 
sketching | H( f )| versus f. 
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Block-Diagram Analysis 


More often than not, a communication system comprises many interconnected 
building blocks or subsystems. Some blocks might be two-port networks with 
known transfer functions, while other blocks might be given in terms of their time- 
domain operations. Any LTI operation, of course, has an equivalent transfer func- 
tion. For reference purposes, Table 3.1—1 lists the transfer functions obtained by 
applying transform theorems to four primitive time-domain operations. 











Table 3.1-1 
Time-Domain Operation Transfer Function 
Scalar multiplication y(t) = +Kx(t) H(f) = £K 

dx(t) 
Differentiation y(t) = ae H(f) = jaf 

‘ 1 

Integration y(t) = | w dà H(f) = Raf 
Time delay y(t) = x(t — ty) A(f) em 


When the subsystems in question are described by individual transfer functions, it 
is possible and desirable to lump them together and speak of the overall system trans- 
fer function. The corresponding relations are given below for two blocks connected in 
parallel, cascade, and feedback. More complicated configurations can be analyzed by 
successive application of these basic rules. One essential assumption must be made, 
however, namely, that any interaction or loading effects have been accounted for in the 
individual transfer functions so that they represent the actual response of the subsys- 
tems in the context of the overall system. (A simple op-amp voltage follower might be 
used to provide isolation between blocks and prevent loading.) 

Figure 3.1-8a diagrams two blocks in parallel: both units have the same input 
and their outputs are summed to get the system’s output. From superposition it fol- 
lows that ¥(f) = (Hi(f£) + H4(f)]X(f ) so the overall transfer function is 


H(f) = Hf) + Hf) Parallel connection [19a] 


In the cascade connection, Fig. 3.1—-85, the output of the first unit is the input to the 


second, so Y (f) = Hf )LH(f JX(f )] and 
H(f) = H(f)HXf) Cascade connection [195] 


The feedback connection, Fig. 3.1-8c, differs from the other two in that the output 
is sent back through H,(f) and subtracted from the input. Thus, 


Y(f) = HX) 7 HAAY) 
and rearranging yields ¥(f) = {H (f )/[1 + Hf HU )])XCf) so 
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HG) X(f) 
Hf) 
X(f) Ld YGS) = (ACF) + Af) XC) 
HAF) 
HX f) XCf) 
(a) 
X(f) HXO) YCf) = HGORCP) XCf) 
Hif) HAS) 
(b) 
X(f) + Hif) 
H Y(f) = —————— Xi 
O - H) CP) Le HH (f) 
E HXf) 
HX f) Xf) 
(c) : 
Figure 3.1-8 (a) Parallel connection; (b) cascade connection; (c) feedback connection. 
Hif) 
Af) = —————— Feedback connection [19d 
DOT TERGJH) 


This case is more properly termed the negative feedback connection as distin- 
guished from positive feedback, where the returned signal is added to the input 
instead of subtracted. 





Zero-Order Hold 


The zero-order hold system in Fig. 3.1-9a has several applications in electrical com- 
munication. Here we take it as an instructive exercise of the parallel and cascade 
relations. But first we need the individual transfer functions, determined as follows: 
the upper branch of the parallel section is a straight-through path so, trivially, 
H,(f) = 1; the lower branch produces pure time delay of T seconds followed by 
sign inversion, and lumping them together gives H,(f) = —e 7?"7; the integrator 
in the final block has H4(f) = 1/j27f. Figure 3.1-9b is the equivalent block dia- 
gram in terms of these transfer functions. 

Having gotten this far, the rest of the work is easy. We combine the parallel 
branches in H,.(f) = Hf) + Hf) and use the cascade rule to obtain 


EXAMPLE 3.1-3 
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x(t) 








Figure 3.1-9 Block diagrams of a zero-order hold. (a) Time domain; {b} frequency domain. 


Af) = Hy(£)H4G) = (HG) + Af) IAS) 





; 1 
—[1—277T 
1 j2uf 
inff ^ ,—jmfT 3 
= e! e? QUIT — sin TfT E 
Jaf Tf 


I 


T sinc f Te IF 
Hence we have the unusual result that the amplitude ratio of this system is a sinc 
function in frequency! 

To confirm this result by another route, let's calculate the impulse response h(t) 
drawing upon the definition that y(t) = A(t) when x(t) = 6(1). Inspection of Fig. 3.1-9a 
shows that the input to the integrator then is x(t) — x(t — T) = ó(t) — ó(t — T), so 


h(t) = | [8(A) — &(A — T)] dA = u(t) — u(t — T) 


-—O00 


which represents a rectangular pulse starting at t = 0. Rewriting the impulse 
response as h(t) = II[(r — T/2)/T] helps verify the transform relation 
h(t) <> H(f). 
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3.2 SIGNAL DISTORTION IN TRANSMISSION 


A signal transmission system is the electrical channel between an information 
source and destination. These systems range in complexity from a simple pair of 
wires to a sophisticated laser-optics link. But all transmission systems have two 
physical attributes of particular concern in communication: internal power dissipa- 
tion that reduces the size of the output signal, and energy storage that alters the 
shape of the output. 

Our purpose here is to formulate the conditions for distortionless signal trans- 
mission, assuming an LTI system so we can work with its transfer function. Then 
we'll define various types of distortion and address possible techniques for mini- 
mizing their effects. 


Distortionless Transmission 


Distortionless transmission means that the output signal has the same "shape" as the 
input. More precisely, given an input signal x(t), we say that 





Analytically, we have distortionless transmission if 


y(t) = Kx(t — tj) [1 
where K and t, are constants. 


The properties of a distortionless system are easily found by examining the out- 
put spectrum 


Y(f) = 9[y()] = Ke ""X(f) 
Now by definition of transfer function, Y(f) = H(f)X(f), so 


H(f) = Ke" [2a] 


In words, a system giving distortionless transmission must have constant amplitude 
response and negative linear phase shift, so 


IH(f)) 2 |K| arg H(f) = —2rtyf + m180° [2b] 


Note that arg H(f) must pass through the origin or intersect at an integer multiple of 
+180°. We have added the term + m180° to the phase to account for K being posi- 
tive or negative. In the case of zero time delay, the phase is constant at 0 or + 180°. 
An important and rather obvious qualification to Eq. (2) should be stated imme- 
diately. The conditions on H(f) are required only over those frequencies where the 
input signal has significant spectral content. To underscore this point, Fig. 3.2-1 
shows the energy spectral density of an average voice signal obtained from labora- 
tory measurements. Since the spectral density is quite small for f « 200 Hz and 
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XC f I? 
t SSS =) oh Ix f 
0 200 3200 
Figure 3.2-1 Energy spectral density of an average voice signal. 


f > 3200 Hz, we conclude that a system satisfying Eq. (2) over 200 = |f| = 3200 
Hz would yield nearly distortion-free voice transmission. Similarly, since the human 
ear only processes sounds between about 20 Hz and 20,000 Hz, an audio system that 
is distortion free in this range is sufficient. 

However, the stringent demands of distortionless transmission can only be sat- 
ished approximately in practice, so transmission systems always produce some 
amount of signal distortion. For the purpose of studying distortion effects on various 
signals, we’ ll define three major types of distortion: 


1. Amplitude distortion, which occurs when 
|H(f)| # |K| 
2. Delay distortion, which occurs when 
arg H(f) # —2vt,f + m180° 


3. Nonlinear distortion, which occurs when the system includes nonlinear ele- 
ments 


The first two types can be grouped under the general designation of linear distor- 
tion, described in terms of the transfer function of a linear system. For the third type, 
the nonlinearity precludes the existence of a transfer function. l 





EXAMPLE 3.2-1 Suppose a transmission system has the frequency response plotted in Fig. 3.2-2. 


This system satisfies Eq. (2) for 20 < | f | = 30 kHz. Otherwise, there's amplitude 
distortion for |f| < 20kHz and |f| > SOKHz, and delay distortion for 
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1.0 
FC) 
t f£ kHz 
0 20 30 50 
0 ; 
arg H( f) l 
-90° 
Figure 3.2-2 


Linear Distortion 


Linear distortion includes any amplitude or delay distortion associated with a linear 
transmission system. Amplitude distortion is easily described .in the frequency 
domain; it means simply that the output frequency components are not in correct 
proportion. Since this is caused by | H(f ) | not being constant with frequency, ampli- 
tude distortion is sometimes called frequency distortion. 

The most common forms of amplitude distortion are excess attenuation or 
enhancement of extreme high or low frequencies in the signal spectrum. Less com- 
mon, but equally bothersome, is disproportionate response to a band of frequencies 
within the spectrum. While the frequency-domain description is easy, the effects in 
the time domain are far less obvious, save for very simple signals. For illustration, a 
suitably simple test signal is x(t) = cos wot — 1/3 cos 3wot + 1/5 cos Swot, a 
rough approximation to a square wave sketched in Fig. 3.2-3. If the low-frequency 
or high-frequency component is attenuated by one-half, the resulting outputs are as 
shown in Fig. 3.2-4. As expected, loss of the high-frequency term reduces the 
"sharpness" of the waveform. 


Af 


Figure 3.2-3 Test signal x(t) = cos wot — 1/3 cos 3wot + 1/5 cos Swot. 
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i 1 
0 t 0 t 
(a) (bi 


Figure 3.2-4 


Test signal with amplitude distortion. (a) Low frequency attenuated; (b) high frequency attenuated. 


Beyond qualitative observations, there’s little more we can say about amplitude 
distortion without experimental study of specific signal types. Results of such stud- 
ies are usually couched in terms of required “flat” frequency response—meaning 
the frequency range over which |H(f)| must be constant to within a certain toler- 
ance so that the amplitude distortion is sufficiently small. 

We now turn our attention to phase shift and time delay. If the phase shift is not 
linear, the various frequency components suffer different amounts of time delay, and 
the resulting distortion is termed phase or delay distortion. For an arbitrary phase 
shift, the time delay is a function of frequency and can be found by writing 
arg H(f) = —2rft,(f ) with all angles expressed in radians. Thus 


arg Hf) 
2mf 


which is independent of frequency only if arg H(f) is linear with frequency. 

A common area of confusion is constant time delay versus constant phase 
shift. The former is desirable and is required for distortionless transmission. The lat- 
ter, in general, causes distortion. Suppose a system has the constant phase shift 0 
not equal to 0? or +m180°. Then each signal frequency component will be delayed 
by 6/27 cycles of its own frequency; this is the meaning of constant phase shift. But 
the time delays will be different, the frequency components will be scrambled in 
time, and distortion will result. 

That constant phase shift does give distortion is simply illustrated by returning 
to the test signal of Fig. 3.2—3 and shifting each component by one-fourth cycle, 0 = 
—90°. Whereas the input was roughly a square wave, the output will look like the 
triangular wave in Fig. 3.2—5. With an arbitrary nonlinear phase shift, the deteriora- 
tion of waveshape can be even more severe. 

You should also note from Fig. 3.2—5 that the peak excursions of the phase- 
shifted signal are substantially greater (by about 50 percent) than those of the input 
test signal. This is not due to amplitude response, since the output amplitudes of the 
three frequency components are, in fact, unchanged; rather, it 1s because the compo- 
nents of the distorted signal all attain maximum or minimum values at the same 
time, which was not true of the input. Conversely, had we started with Fig. 3.2—5 as 


t(f) = [3] 
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Figure 3.2-5 Test signal with constant phase shift @ = —90°. 


the test signal, a constant phase shift of +90° would yield Fig. 3.2-3 for the output 
waveform. Thus we see that delay distortion alone can result in an increase or 
decrease of peak values as well as other waveshape alterations. 





Let's take a closer look at the impact of phase delay on a modulated signal. The 
transfer function of an arbitrary channel can be expressed as 


H(f) = Ae ?nftté) — (Ae!) ese [4] 


where arg H(f) = —2aft, + bo leads to tdf) = t, — bo/27f from Eq. (3). If the 
input to this bandpass channel is 


x(t) = x,(t) cos wt — x(t) sin w,t [5] 


then by the time delay property of Fourier transforms, the output will be delayed by 


t,. Since e/* can be incorporated into the sine and cosine terms, the output of the 
channel is 


y(t) = Ax(t — tj) cos [w(t — t) + Po] — AxY(t — ty) sin [w(t — t) + do] 
We observe that arg H(f.) = —«t, + do —«,t,so that 
y(t) = Axi(t — t,) cos [w(t — ty)] — Axa(t — t,) sin [ex(t — t4)] [6] 


From Eq. (6) we see that the carrier has been delayed by t, and the signals that mod- 
ulate the carrier, x, and x», are delayed by t,. The time delay t, corresponding to the 
phase shift in the carrier is called the phase delay of the channel. This delay is also 
sometimes referred to as the carrier delay. The delay between the envelope of the 
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input signal and that of the received signal, t,, is called the envelope or group delay 
of the channel. In general, t; 7 t,. 

This leads to a set of conditions under which a linear bandpass channel is dis- 
tortionless. As in the general case of distortionless transmission described earlier, 
the amplitude response must be constant. For the channel in Eq. (4) this implies 
|H(f)| = |A|. In order to recover the original signals x, and x;, the group delay 
must be constant. Therefore, from Eq. (4) this implies that t, can be found directly 
from the derivative of arg H(f) = 06(f) as 


ut) 


| is df [7] 


Note that this condition on arg H(f) is less restrictive than in the general case pre- 
sented earlier. If dy = 0 then the general conditions of distortionless transmission 
are met and tz = f,. 





EXERCISE 3.2-1 Use Eq. (3) to plot t4( f) from arg H(f) given in Fig. 3.2-2. 





Equalization 


Linear distortion—both amplitude and delay—is theoretically curable through the use 
of equalization networks. Figure 3.2-6 shows an equalizer H,,(f) in cascade with a 
distorting transmission channel H-(f). Since the overall transfer function is 
A(f) = He(f)E. (f) the final output will be distortionless if Ho(f)A.g(f) = 
Ke ~/*, where K and t; are more or less arbitrary constants. Therefore, we require that 


Ke ive 
H4) = cg; 





[8) 


wherever X(f ) # 0. 

Rare is the case when an equalizer can be designed to satisfy Eq. (8) exactly— 
which is why we say that equalization is a theoretical cure. But excellent approxi- 
mations often are possible so that linear distortion can be reduced to a tolerable 
level. Probably the oldest equalization technique involves the use of loading coils on 
twisted-pair telephone lines. These coils are lumped inductors placed in shunt across 
the line every kilometer or so, giving the improved amplitude ratio typically illus- 
trated in Fig. 3.2-7. Other lumped-element circuits have been designed for specific 
equalization tasks. 


Channel Equalizer 


J 


Figure 3.2-6 Channel with equalizer for linear distortion. 
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Figure 3.2-7 Amplitude ratio of a typical telephone line with and without loading coils for 
equalization. 


More recently, the tapped-delay-line equalizer, or transversal filter, has 
emerged as a convenient and flexible device. To illustrate the principle, Fig. 3.2-8 
shows a delay line with total time delay 2A having taps at each end and the middle. 


The tap outputs are passed through adjustable gains, c_,, co, and c}, and summed to 
form the final output. Thus 


y(t) = cout) + cox(t — A) + cx(t 2A) [9a] 
and 
Hf) Spade epee gema 
= (cett + cy + ce I) eos [9b] 
Generalizing Eq. (9b) to the case of a 2MA delay line with 2M + 1 taps yields 
M 
Hf) = ( X eem emm [10] 


which has the form of an exponential Fourier series with frequency periodicity 1/A. 
Therefore, given a channel H,(f) to be equalized over |f| < W, you can approxi- 
mate the right-hand side of Eq. (8) by a Fourier series with frequency periodicity 
1/A = W (thereby determining A), estimate the number of significant terms (which 
determines M), and match the tap gains to the series coefficients. 


Tapped delay line 


mf ls 
Adjustable 
gains 


Figure 3.2-8 Transversal filter with three taps. 
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In many applications, the tap gains must be readjusted from time to time to com- 
pensate for changing channel characteristics. Adjustable equalization is especially 
important in switched communication networks, such as a telephone system, since 
the route between source and destination cannot be determined in advance. Sophisti- 
cated adaptive equalizers have therefore been designed with provision for automatic 
readjustment. 

Adaptive equalization is usually implemented with digital circuitry and micro- 
processor control, in which case the delay line may be replaced by a shift register 
or charge-coupled device (CCD). For fixed (nonadjustable) equalizers, the trans- 
versal filter can be fabricated in an integrated circuit using a surface-acoustic-wave 
(SAW) device. 





EXAMPLE 3.2-2 


Multipath Distortion 


Radio systems sometimes suffer from multipath distortion caused by two (or more) 
propagation paths between transmitter and receiver. Reflections due to mismatched 
impedance on a cable system produce the same effect. As a simple example, sup- 
pose the channel output is 
y(t) = Kix(t — tj) + K»x(t — t) 
whose second term corresponds to an echo of the first if t4 > t,. Then 
Hf) = ke + K,e je" nu 
= Kye) + kee”) 

where k = K/K; and ty = t; — ty. 

If we take K = K, and t, = f, for simplicity in Eq. (8), the required equalizer 
characteristic becomes 


lI 1 RUN —jarty 2, —j20tg 
Hall) > iue Oe Ek enea ae ee 
The binomial expansion has been used here because, in this case, it leads to the form 
of Eq. (10) without any Fourier-series calculations. Assuming a small echo, so that 
k? < 1, we drop the higher-power terms and rewrite H.,(f) as 


HA = (eten -k+ kg into) o 7jots 


Comparison with Eqs. (95) or (10) now reveals that a three-tap transversal filter will 
do the job if c_; = 1,c9 = —k, c, = K^, and A = h. 





A 
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Nonlinear Distortion and Companding 


A system having nonlinear elements cannot be described by a transfer function. 
Instead, the instantaneous values of input and output are related by a curve or func- 
tion y(t) = T|x(t)], commonly called the transfer characteristic. Figure 3.2-9 is a 
representative transfer characteristic; the flattening out of the output for large input 
excursions is the familiar saturation-and-cutoff effect of transistor amplifiers. We'll 
consider only memoryless devices, for which the transfer characteristic 1s a com- 
plete description. 

Under small-signal input conditions, it may be possible to linearize the transfer 
characteristic in a plecewise fashion, as shown by the thin lines in the figure. The 
more general approach is a polynomial approximation to the curve, of the form 


y(t) = a x(t) + az x (t) + az x(t) + [12a] 


and the higher powers of x(¢) in this equation give rise to the nonlinear distortion. 
Even though we have no transfer function, the output spectrum can be found, at 


least in a formal way, by transforming Eq. (12a). Specifically, invoking the convolu- 
tion theorem, 


Y(f) = a, X(ff)taX*X(f)ta,X*«X*X(f) + > [125] 


Now if x(t) is bandlimited in W, the output of a linear network will contain no fre- 
quencies beyond |f| < W. But in the nonlinear case, we see that the output 
includes X * X(f), which is bandlimited in 2W, X * X * X(f ), which is bandlimited 
in 3W, and so on. The nonlinearities have therefore created output frequency com- 
ponents that were not present in the input. Furthermore, since X + X(f ) may contain 
components for | f | < W, this portion of the spectrum overlaps that of X(f). Using 
filtering techniques, the added components at | f | > W can be removed, but there is 
no convenient way to get rid of the added components at | f| < W. These, in fact, 
constitute the nonlinear distortion. 


y- Tix 





Figure 3.2-9 Transfer characteristic of a nonlinear device. 
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A quantitative measure of nonlinear distortion is provided by taking a simple 
cosine wave, x(t) = cos wot, as the input. Inserting in Eq. (12a) and expanding 


yields 
a 3a 3a 
y(t) = (2.9. =) + (a+ 4 «) COS Wot 


+ (2 Q5 2oyt + 

< t — + e | cos Jai xe 

2 4 : 

Therefore, the nonlinear distortion appears as harmonics of the input wave. The 
amount of second-harmonic distortion is the ratio of the amplitude of this term to 
that of the fundamental, or in percent: 

aa/2 T a4/4 T s 


Second-harmonic distortion — 
ay + 3a3/4 + ose 








| X 100% 


Higher-order harmonics are treated similarly. However, their effect is usually much 
less, and many can be removed entirely by filtering. 

If the input is a sum of two cosine waves, say cos «ft + cos «st, the output will 
include all the harmonics of f, and fọ, plus crossproduct terms which yield f; — fi, 
RA ff 2f; etc. These sum and difference frequencies are designated as inter- 
modulation distortion. Generalizing the intermodulation effect, if x(f) = x,(f) + x(t), 
then y(t) contains the crossproduct x,(‘)x2(f) (and higher-order products, which we 
ignore here). In the frequency domain x, (x(t) becomes X, * X(f); and even though 
X,(f) and X(f) may be separated in frequency, X, * X(f) can overlap both of them, 
producing one form of cross talk. This aspect of nonlinear distortion is of particular 
concern in telephone transmission systems. On the other hand the crossproduct term is 
the desired result when nonlinear devices are used for modulation purposes. 

It is important to note the difference between cross talk and other types of inter- 
ference. Cross talk occurs when one signal crosses over to the frequency band of 
another signal due to nonlinear distortion in the channel. Picking up a conversation 
on a cordless phone or baby monitor occurs because the frequency spectrum allo- 
cated to such devices is too crowded to accommodate all of the users on separate fre- 
quency carriers. Therefore some "sharing" may occur from time to time. While 
cross talk resulting from nonlinear distortion is now rare in telephone transmission 
due to advances in technology, it was a major problem at one time. 

The crossproduct term is the desired result when nonlinear devices are used for 
modulation purposes. In Sect. 4.3 we will examine how nonlinear devices can be 
used to achieve amplitude modulation. In Chap. 5, carefully controlled nonlinear 
distortion again appears in both modulation and detection of FM signals. 

Although nonlinear distortion has no perfect cure, it can be minimized by care- 
ful design. The basic idea is to make sure that the signal does not exceed the linear 
operating range of the channel’s transfer characteristic. Ironically, one strategy 
along this line utilizes two nonlinear signal processors, a compressor at the input 
and an expander at the output, as shown in Fig. 3.2-10. 
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Figure 3.2-10 Companding system. 


A compressor has greater amplification at low signal levels than at high signal 
levels, similar to Fig. 3.2—9, and thereby compresses the range of the input signal. If 
the compressed signal falls within the linear range of the channel, the signal at the 
channel output is proportional to Ti, 4,[x(f)] which is distorted by the compressor but 
not the channel. Ideally, then, the expander has a characteristic that perfectly comple- 
ments the compressor so the expanded output is proportional to Tu (Tissu [x(0]). = 
x(f), as desired. 

The joint use of compressing and expanding is called companding (surprise?) 
and is of particular value in telephone systems. Besides reducing nonlinear distor- 
tion, companding tends to compensate for the signal-level difference between loud 
and soft talkers. Indeed, the latter is the key advantage of companding compared to 
the simpler technique of linearly attenuating the signal at the input (to keep it in the 
linear range of the channel) and linearly amplifying it at the output. 


3.3 TRANSMISSION LOSS AND DECIBELS 


In addition to any signal distortion, a transmission system also reduces the power 
level or “strength” of the output signal. This signal-stréngth reduction is expressed 
in terms of transmission power loss. Although transmission loss can be compensated 
by power amplification, the ever-present electrical noise may prevent successful sig- 
nal recovery in the face of large transmission loss. 

This section describes transmission loss encountered on cable and radio com- 
munication systems. We'll start with a brief review of the more familiar concept of 


power gain, and we'll introduce decibels as a handy measure of power ratios used by 
communication engineers. 


Power Gain 


Let Fig. 3.3-1 represent an LTI system whose input signal has average power P. If 


the system is distortionless, the average signal power at the output will be propor- 
tional to P,,. Thus, the system's power gain is 


g = Paul Pin [1] 


a constant parameter not to be confused with our step-response notation g(t). Sys- 
tems that include amplification may have very large values of g, so we'll find it con- 
venient to express power gain in decibels (dB) defined as 


SaB = 10 logio g [2] 
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Figure 3.3-1 LTI system with power gain g. 


The “B” in dB is capitalized in honor of Alexander Graham Bell who first used 
logarithmic power measurements. 

Since the decibel is a logarithmic unit, it converts powers of 10 to products of 
10. For instance, g = 10" becomes gag = m X 10 dB. Power gain is always posi- 
tive, of course, but negative dB values occur when g x 1.0 — 10° and hence 
gap = 0 dB. Note carefully that 0 dB corresponds to unity gain (g = 1). Given a 
value in dB, the ratio value is 


g = 10/10 [3] 


obtained by inversion of Eq. (2). 
While decibels always represent power ratios, signal power itself may be 
expressed in dB if you divide P by one watt or one milliwatt, as follows: 





P 
Pu 10 logio Ty Paga, = 10 logio [4] 


1 mW 
Rewriting Eq. (1) as (P4,/1 mW) = g(P,,/1 mW) and taking the logarithm of both 
sides then yields the dB equation 


Poutan T Sab uz Pis 


Such manipulations have particular advantages for the more complicated relations 
encountered subsequently, where multiplication and division become addition and 
subtraction of known dB quantities. Communication engineers usually work with 
dBm because the signal powers are quite small at the output of a transmission system. 

Now consider a system described by its transfer function H(f). A sinusoidal 
input with amplitude A, produces the output amplitude A, = |H(f)|A,, and the 
normalized signal powers are P, = A2/2 and P, = A2/2 = |H(f)|’P,. These nor- 
malized powers do not necessarily equal the actual powers in Eq. (1). However, 
when the system has the same impedance level at input and output, the ratio P,/P, 
does equal P,,4,/ P;,. Therefore, if H(f) = Ke 7" then 


g = |HP = K? DI 


In this case, the power gain also applies to energy signals in the sense that E, = gE, 
When the system has unequal input and output impedances, the power (and energy) 
gain is proportional to K?. 

If the system is frequency-selective, Eq. (5) does not hold but | H( f)? still tells 
us how the gain varies as a function of frequency. For a useful measure of frequency 
dependence in terms of signal power we take 
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|H(f)\as = 10 logis | (f£) [6] 


which represents the relative gain in dB. 





EXERCISE 3.3-1 


(a) Verify that Pag, = Pasw + 30 dB. (b) Show that if |H(f)las = —3 dB then 
|H(f)| = 1/2 and |H(f)|? ~ 3. The significance of this result is discussed in the 
section on real filters. 





Transmission Loss and Repeaters 


Any passive transmission medium has power loss rather than gain, since Pou < Pip. 
We therefore prefer to work with the transmission loss, or attenuation 
A 
L= 1/2 = Pol P oui 
Lag = —8ap = 10 logig Pin/ Pour [7] 
Hence, P4, = Pin/L and Pong, = Ping, — Las. 
In the case of transmission lines, coaxial and fiber-optic cables, and wave- 


guides, the output power decreases exponentially with distance. We'll write this 
relation in the form 


P out 7 10 619p in 


where £ is the path length between source and destination and a is the attenuation 
coefficient in dB per unit length. Equation (7) then becomes 








LssqgewUr.  Lp=Ħa&£ [8] 
Table 3.3-1 Typical values of transmission loss 
Transmission Medium Frequency Loss dB/km 
Open-wire pair (0.3 cm diameter) 1 kHz 0.05 
Twisted-wire pair (16 gauge) 10 kHz 2 


100 kHz 
300 kHz 


Coaxial cable (1 cm diameter) 100 kHz 


1 MHz 
3 MHz 


3 
6 
1 
2 
4 
Coaxial cable (15 cm diameter) 100 MHz 1.5 
Rectangular waveguide (5 X 2.5 cm) 10 GHz 5 
Helical waveguide (5 cm diameter) 100 GHz 1. 
2. 
0. 
0 


mm tA 


Fiber-optic cable 3.6 x 10^ Hz 
2.4 X 10! Hz 
1.8 x 10^ Hz 


N U 
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showing that the dB loss is proportional to the length. Table 3.3-1 lists some typical 
values of a for various transmission media and signal frequencies. 

Attenuation values in dB somewhat obscure the dramatic decrease of signal 
power with distance. To bring out the implications of Eq. (8) more clearly, suppose 
you transmit a signal on a 30 km length of cable having a = 3 dB/km. Then 
Lg = 3 X30 = 90 dB, L = 10°, and P u = 107? P,. Doubling the path length 
doubles the attenuation to 180 dB, so that L = 103? and Pou = 10718 Pip This loss is 
so great that you'd need an input power of one megawatt (10$ W) to get an output 
power of one picowatt (10^? W)! 

Large attenuation certainly calls for amplification to boost the output signal. As 
an example, Fig. 3.3—2 represents a cable transmission system with an output ampli- 
fier and a repeater amplifier inserted near the middle of the path. (Any preamplifi- 
cation at the input would be absorbed in the value of Pia) Since power gains multi- 
ply in a cascade connection like this, 





OQ. 
Bass P; [9a] 


P ut 7 Pa m in 
out = (£182 83 84) Likes 


which becomes the dB equation 
Pow = (82 + 84) — (Li + Ls) + Pin [9b] 


We’ve dropped the dB subscripts here for simplicity, but the addition and subtrac- 
tion in Eq. (9b) unambiguously identifies it as a dB equation. Of course, the units of 
Pou (dBW or dBm) will be the same as those of Pin- 

The repeater in Fig. 3.3—2 has been placed near the middle of the path to pre- 
vent the signal power from dropping down into the noise level of the amplifier. 
Long-haul cable systems have repeaters spaced every few kilometers for this reason, 
and a transcontinental telephone link might include more than 2000 repeaters. The 
signal-power analysis of such systems follows the same lines as Eq. (9). The noise 
analysis is presented in the Appendix. 


Fiber Optics 


Optical communication systems have become increasingly popular over the last two 
decades with advances in laser and fiber-optic technologies. Because optical sys- 
tems use carrier frequencies in the range of 2 X 10^ Hz, the transmitted signals can 
have much larger bandwidth than is possible with metal cables such as twisted-wire 


Cable Repeater Cable Output 
section amplifier section amplifier 





Figure 3.3-2 Cable transmission system with a repeater amplifier. 





| 


3.3 Transmission Loss and Decibels 


pair and coaxial cable. We will see in the next chapter that the theoretical maximum 
bandwidth for that carrier frequency is on the order of 2 X 10P Hz! While we may 
never need that much bandwidth, it is nice to have extra if we need it. 

In the 1960s fiber-optic cables were extremely lossy, with losses around 
1000 dB/km, and were impractical for commercial use. Today these losses are on 
the order of 0.2 to 2 dB/km depending on the type of fiber used and the wavelength 
of the signal. This is lower than most twisted-wire pair and coaxial cable systems. 
There are many advantages to using fiber-optic channels in addition to large band- 
width and low loss. The dielectric waveguide property of the optical fiber makes it 
less susceptible to interference from external sources. Since the transmitted signal is 
light rather than current, there is no electromagnetic field to generate cross talk and 
no radiated RF energy to interfere with other communication systems. In addition, 
since moving photons do not interact, there is no noise generated inside the optical 
fiber. Fiber-optic channels are safer to install and maintain since there is no large 
current or voltage to worry about. Furthermore, since it is virtually impossible to tap 
into a fiber-optic channel without the user detecting it, they are secure enough for 
military applications. They are rugged and flexible, and operate over a larger tem- 
perature variation than metal cable. The small size (about the diameter of a human 
hair) and weight mean they take up less storage space and are cheaper to transport. 
Finally, they are fabricated from sand, which is a plentiful resource. While the up- 
front installation costs are higher, it is predicted that the long-term costs will ulti- 
mately be lower than with metal-based cables. 

Most optical communication systems are digital since system limitations on the 
amplitude of analog modulation make it impractical. The system is a hybrid of elec- 
trical and optical components, since the signal sources and final receivers are still 
made up of electronics. Optical transmitters use either LEDs or solid-state lasers to 
generate light pulses. The choice between these two is driven by design constraints. 
LEDs, which produce noncoherent (multiple wavelengths) light, are rugged, inex- 
pensive, and have low power output (—0.5 mW). Lasers are much higher in cost and 
have a shorter lifetime; however they produce coherent (single wavelength) light 
and have a power output of around 5 mW. The receivers are usually PIN diodes or 
avalanche photodiodes (APD), depending on the wavelength of the transmitted sig- 
nal. In the remainder of this discussion we will concentrate our attention on the 
fiber-optic channel itself. 

Fiber-optic cables have a core made of pure silica glass surrounded by a 
cladding layer also usually made of silica glass, but sometimes made of plastic. 
There is an outer, thin protective jacket made of plastic in most cases. In the core the 
signal traverses the fiber. The cladding reduces losses by keeping the signal power 
within the core. There are three main types of fiber-optic cable: single-mode fibers, 
multimode step-index fibers, and multimode graded-index fibers. Figure 3.3—3a 
shows three light rays traversing a single-mode fiber. Because the diameter of the 
core is sufficiently small (~8 um), there is only a single path for each of the rays to 
follow as they propagate down the length of the fiber. The difference in the index of 
refraction between the core and cladding layers causes the light to be reflected back 
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Figure 3.3-3 {a} Light propagation down a single-mode step-index fiber. {b} Light propaga- 
tion down a multimode step-index fiber. [c) Light propagation down a multi- 
mode graded-index fiber. 


into the channel, and thus the rays follow a straight path through the fiber. Conse- 
quently, each ray of light travels the same distance in a given period of time, and a 
pulse input would have essentially the same shape at the output. Therefore single- 
mode fibers have the capacity for large transmission bandwidths, which makes them 
very popular for commercial applications. However, the small core diameter makes 
it difficult to align cable section boundaries and to couple the source to the fiber, and 
thus losses can occur. 

Multimode fibers allow multiple paths through the cable. Because they have a 
larger core diameter (—50 um) it is easier to splice and couple the fiber segments, 
resulting in less loss. In addition, more light rays at differing angles can enter the 
channel. In a multimode step-index fiber there is a step change between the index of 
refraction of the core and cladding, as there is with single-mode fibers. Fig. 3.3-3b 
shows three rays entering a multimode step-index fiber at various angles. It is clear 
that the paths of the rays will be quite different. Ray 1 travels straight through as in 
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the case of the single-mode fiber. Ray 2 is reflected off of the core-cladding boun- 
dary a few times and thus takes a longer path through the cable. Ray 3, with multi- 
ple reflections, has a much longer path. As Fig. 3.3—3b shows, the angle of incidence 
impacts the time to reach the receiver. We can define two terms to describe this 
channel delay. The average time difference between the arrivals of the various rays 
is termed mean-time delay, and the standard deviation is called the delay spread. 
The impact on a narrow pulse would be to broaden the pulse width as the signal 
propagates down the channel. If the broadening exceeds the gap between the pulses, 
overlap may result and the pulses will not be distinguishable at the output. Therefore 
the maximum bandwidth of the transmitted signal in a multimode step-index chan- 
nel is much lower than in the single mode case. 

Multimode graded-index fibers give us the best of both worlds in performance. 
The large central core has an index of refraction that is not uniform. The refractive 
index is greatest at the center and tapers gradually toward the outer edge. As shown 
in Fig. 3.3—3c, the rays again propagate along multiple paths; however since they are 
constantly refracted there is a continuous bending of the light rays. The velocity of 
the wave is inversely proportional to the refractive index so that those waves farthest 
from the center propagate fastest. The refractive index profile can be designed so 
that all of the waves have approximately the same delay when they reach the output. 
Therefore the lower dispersion permits higher transmission bandwidth. While the 
bandwidth of a multimode graded-index fiber is lower than that of a single-mode 
fiber, the benefits of the larger core diameter are sufficient to make it suitable for 
long-distance communication applications. 

With all of the fiber types there are several places where losses occur, including 
where the fiber meets the transmitter or receiver, where the fiber sections connect to 
each other, and within the fiber itself. Attenuation within the fiber results primarily 
from absorption losses due to impurities in the silica glass, and scattering losses 
due to imperfections in the waveguide. Losses increase exponentially with distance 
traversed and also vary with wavelength. There are three wavelength regions where 
there are relative minima in the attenuation curve, and they are given in Table 3.3-1. 
The smallest amount of loss occurs around 1300 and 1500 nm, so those frequencies 
are used most often for long-distance communication systems. 

Current commercial applications require repeaters approximately every 40 km. 
However, each year brings technology advances, so this spacing continues to 
increase. Conventional repeater amplifiers convert the light wave to an electrical sig- 
nal, amplify it, and convert it back to an optical signal for retransmission. However, 
light wave amplifiers are being developed and may be available soon. 

Fiber-optic communication systems are quickly becoming the standard for 
long-distance telecommunications. Homes and businesses are increasingly wired 
internal and externally with optical fibers. Long-distance telephone companies 
advertise the clear, quiet channels with claims that listeners can hear a pin drop. 
Underwater fiber cables now cover more than two-thirds of the world's circumfer- 
ence and can handle over 100,000 telephone conversations at one time. Compare 
that to the first transoceanic cable that was a technological breakthrough in 1956 and 
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carried just 36 voice channels. While current systems can handle 90 Mbits/sec to 
2.5 Gbits/sec, there have been experimental results as high as 1000 Gbits/sec. At 
current transmission rates of 64 kbits/sec, this represents 15 million telephone con- 
versations over a single optical fiber. As capacity continues to expand, we will no 
doubt find new ways to fill it. 


Radio Transmission * 


Signal transmission by radiowave propagation can reduce the required number of 
repeaters, and has the additional advantage of eliminating long cables. Although 
radio involves modulation processes described in later chapters, it seems appropriate 
here to examine the transmission loss for line-of-sight propagation illustrated in 
Fig. 3.3-4 where the radio wave travels a direct path from transmitting to receiving 
antenna. This propagation mode is commonly employed for long-distance commu- 
nication at frequencies above about 100 MHz. 

The free-space loss on a line-of-sight path is due to spherical dispersion of the 
radio wave. This loss is given by 


2 2 
L= (=) = (SE) [10a] 
À c 


in which A is the wavelength, f the signal frequency, and c the speed of light. If we 
express £ in kilometers and f in gigahertz (10? Hz), Eq. (10a) becomes 


Lap = 92.4 + 20 logio for: + 20 logio Cin [105] 


We see that Lap increases as the logarithm of £, rather than in direct proportion to path 
length. Thus, for instance, doubling the path length increases the loss by only 6 dB. 

Furthermore, directional antennas have a focusing effect that acts like amplifi- 
cation in the sense that 


ETER 
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Pow = Tu 01} 








Figure 3.3-4 Line-oFsight radio transmission. 
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where gr and gp represent the antenna gains at the transmitter and receiver. The 
maximum transmitting or receiving gain of an antenna with effective aperture area 
A, is 

2 
g= ma: = uE [12] 
À c 
where c ~ 3 X 10? km/s. The value of A, for a horn or dish antenna approximately 
equals its physical area, and large parabolic dishes may provide gains in excess of 
60 dB. 

Commercial radio stations often use compression to produce a transmitted sig- 
nal that has higher power but doesn't exceed the system's linear operating region. As 
mentioned in Sect. 3.2, compression provides greater amplification of low-level sig- 
nals, and can raise them above the background noise level. However since your 
home radio does not have a built-in expander to complete the companding process, 
some audible distortion may be present. To cope with this, music production com- 
panies often preprocess the materials sent to radio stations to ensure the integrity of 
the desired sound. 

Satellites employ line-of-sight radio transmission over very long distances. 
They have a broad coverage area and can reach areas that are not covered by cable 
or fiber, including mobile platforms such as ships and planes. Even though fiber- 
optic systems are carrying an increasing amount of transoceanic telephone traffic 
(and may make satellites obsolete for many applications), satellite relays still han- 
dle the bulk of very long distance telecommunications. Satellite relays also make it 
possible to transmit TV signals across the ocean. They have a wide bandwidth of 
about 500 MHz that can be subdivided for use by individual transponders. Most 
satellites are in geostationary orbit. This means that they are synchronous with 
Earth's rotation and are located directly above the equator, and thus they appear 
stationary in the sky. The main advantage is that antennas on Earth pointing at the 
satellite can be fixed. 

A typical C band satellite has an uplink frequency of 6 GHz, a downlink fre- 
quency of 4 GHz, and 12 transponders each having a bandwidth of 36 MHz. The 
advantages in using this frequency range are that it allows use of relatively inexpen- 
sive microwave equipment, has low attenuation due to rainfall (the primary atmo- 
spheric cause of signal loss), and has a low sky background noise. However, there 
can be severe interference from terrestrial microwave systems, so many satellites 
now use the Ku band. The Ku band frequencies are 14 GHz for uplink and 12 GHz 
for downlink. This allows smaller and less expensive antennas. C band satellites are 
most commonly used for commercial cable TV systems, whereas Ku band is used 
for videoconferencing. A newer service that allows direct broadcast satellites (DBS) 
for home television service uses 17 GHz for uplink and 12 GHz for downlink. 

By their nature, satellites require multiple users to access them from different 
locations at the same time. A variety of multiple access techniques have been devel- 
oped, and will be discussed further in a later chapter. Personal communication 
devices such as cellular phones rely on multiple access techniques such as time 
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division multiple access (TDMA) and code division multiple access (CDMA). Prop- 
agation delay can be a problem over long distances for voice communication, and 
may require echo cancellation in the channel. 

Current technology allows portable satellite uplink systems to travel to where 
news or an event is happening. In fact, all equipment can fit in a van or in several 
large trunks that can be shipped on an airplane. For a more complete but practical 
overview of satellites, see Tomasi (1994, Chap. 18). 


Satellite Relay System 


Figure 3.3-5 shows a simplified transoceanic television system with a satellite relay 
serving as a repeater. The satellite is in geostationary orbit and is about 22,300 miles 
(36,000 km) above the equator. The uplink frequency is 6 GHz, and the downlink 
frequency is 4 GHz. Equation (10b) gives an uplink path loss 


L, = 92.4 + 20 logio 6 + 20 logio 3.6 X 10^ = 199.1 dB 
and a downlink loss 
La = 92.4 + 20 logy 4 + 20 log, 93.6 X 104 = 195.6 dB 


since the distance from the transmitter and receiver towers to the satellite is approx- 
imately the same as the distance from Earth to the satellite. The antenna gains in dB 
are given on the drawing with subscripts identifying the various functions—for 
example, ggy stands for the receiving antenna gain on the uplink from ground to 
satellite. The satellite has a repeater amplifier that produces a typical output of 
18 dBW. If the transmitter input power is 35 dBW, the power received at the satellite 
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Figure 3.3-5 Satellite relay system. 
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is 35 dBW + 55 dB — 199.1 dB + 20dB = —144.1 dBW. The power output at 


the receiver is 18 dBW + 16dB — 195.6 dB + 51 dB = —110.6 dBW. Inverting 
Eq. (4) gives 
Po = 100060 X 1 W = 8.7 X 1072W 


Such minute power levels are typical for satellite systems. 





A 40 km cable system has Pin = 2 W and a repeater with 64 dB gain is inserted EXERCISE 3.3-2 
24 km from the input. The cable sections have œ = 2.5 dB/km. Use dB equations to 
find the signal power at: (a) the repeater's input; (b) the final output. 








3.4 FILTERS AND FILTERING 


Virtually every communication system includes one or more filters for the purpose 
of separating an information-bearing signal from unwanted contaminations such as 
interference, noise, and distortion products. In this section we'll define ideal filters, 
describe the differences between real and ideal filters, and examine the effect of fil- 
tering on pulsed signals. 


Ideal Filters 


By definition, an ideal filter has the characteristics of distortionless transmission 
over one or more specified frequency bands and zero response at all other frequen- 
cies. In particular, the transfer function of an ideal bandpass filter (BPF) is 


Med c" Sc 


1 
0 otherwise ul 


as plotted in Fig. 3.4-1. The parameters f? and f, are the lower and upper cutoff fre- 


quencies, respectively, since they mark the end points of the passband. The filter's 
bandwidth is 


B sade (eae 


which we measure in terms of the positive-frequency portion of the passband. 

In similar fashion, an ideal lowpass filter (LPF) is defined by Eq. (1) with 
fe = 0, so B = f, while an ideal highpass filter (HPF) has f? > O and f, = co. 
Ideal band-rejection or notch filters provide distortionless transmission over all 
frequencies except some stopband, say f; = |f| = fı where H(f) = 0. 

But all such filters are physically unrealizable in the sense that their character- 
istics cannot be achieved with a finite number of elements. We'll skip the general 
proof of this assertion. Instead, we'll give an instructive plausibility argument based 
on the impulse response. 
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Figure 3.4-1 Transfer function of an ideal bandpass filter. 


HCf) 
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Figure 3.4-2 Ideal lowpass filter. (a) Transfer function; [b] impulse response. 


Consider an ideal LPF whose transfer function, shown in Fig. 3.4-2a, can be 
written as 


H(f) = Ke qT] (£) [2a] 


Its impulse response will be 
h(t) = F'[H(f)] = 2BK sinc 2B(t — tg) [2b] 


which is sketched in Fig. 3.4—2b. Since h(t) is the response to 6(t) and A(t) has 
nonzero values for t < 0, the output appears before the input is applied. Such a fil- 
ter is said to be anticipatory or noncausal, and the portion of the output appearing 
before the input is called a precursor. Without doubt, such behavior is physically 
impossible, and hence the filter must be unrealizable. Like results hold for the ideal 
BPF and HPF. 
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Fictitious though they may be, ideal filters have great conceptual value in the 


study of communication systems. Furthermore, many real filters come quite close to 
ideal behavior. 





Show that the impulse response of an ideal BPF is 
h(t) = 2BK sinc B(t — tg) cos w,(t — tg) 
where w, = 7(fe + fa). 





EXERCISE 3.4-1 





Bandlimiting and Timelimiting 


Earlier we said that a signal u(¢) is bandlimited if there exists some constant W such 
that 


Wf)-0 |fl>W 


Hence, the spectrum has no content outside | f| > W. Similarly, a timelimited sig- 
nal is defined by the property that, for the constants t; < t5, 


u(t) = 0 t « t, andt 7 t, 


Hence, the signal "starts" at t = t, and "ends" at t = t;. Let's further examine these 
two definitions in the light of real versus ideal filters. 

The concepts of ideal filtering and bandlimited signals go hand in hand, since 
applying a signal to an ideal LPF produces a bandlimited signal at the output. We've 
also seen that the impulse response of an ideal LPF is a sinc pulse lasting for all 
time. We now assert that any signal emerging from an ideal LPF will exist for all 
time. Consequently, a strictly bandlimited signal cannot be timelimited. Conversely, 
by duality, a strictly timelimited signal cannot be bandlimited. Every transform pair 
we've encountered supports these assertions, and a general proof is given in Wozen- 
craft and Jacobs (1965, App. 5B). Thus, 





This observation raises concerns about the signal and filter models used in the 
study of communication systems. Since a signal cannot be both bandlimited and 
timelimited, we should either abandon bandlimited signals (and ideal filters) or else 
we must accept signal models that exist for all time. On the one hand, we recognize 
that any real signal is timelimited, having starting and ending times. On the other 
hand, the concepts of bandlimited spectra and ideal filters are too useful and appeal- 
ing to be dismissed entirely. 
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The resolution of our dilemma is really not so difficult, requiring but a small 
compromise. Although a strictly timelimited signal is not strictly bandlimited, its 
spectrum may be negligibly small above some upper frequency limit W. Likewise, a 
strictly bandlimited signal may be negligibly small outside a certain time interval 
I X f X t$. Therefore, we will often assume that signals are essentially both band- 
limited and timelimited for most practical purposes. 


Real Filters 


The design of realizable filters that approach ideal behavior is an advanced topic 
outside the scope of this book. But we should at least look at the major differences 
between real and ideal filters to gain some understanding of the approximations 
implied by the assumption of an ideal filter. Further information on filter design and 
implementation can be found in texts such as Van Valkenburg (1982). 

To begin our discussion, Fig. 3.4—3 shows the amplitude ratio of a typical real 
bandpass filter. Compared with the ideal BPF in Fig. 3.4—1, we see a passband 
where |H(f )| is relatively large (but not constant) and stopbands where |H(f)| is 
quite small (but not zero). The end points of the passband are usually defined by 


Inf)! = E IB) Inox = E Werne t5 


so that |H(f)|* falls no lower than K?/2 for fe = |f| =f, The bandwidth 
B =f, — fe is then called the half-power or 3 dB bandwidth. Similarly, the end 
points of the stopbands can be taken where | H(f)| drops to a suitably small value 
such as K/10 or K/100. 

Between the passband and stopbands are transition regions, shown shaded, 
where the filter neither “passes” nor “rejects” frequency components. Therefore, 
effective signal filtering often depends on having a filter with very narrow transition 
regions. We'll pursue this aspect by examining one particular class of filters in some 
detail. Then we'll describe other popular designs. 

The simplest of the standard filter types is the nth-order Butterworth LPF, 
whose circuit contains n reactive elements (capacitors and inductors). The transfer 
function with K = 1 has the form 


Kf) 









Stopband Passband Stopband 


: P : 


Figure 3.4-3 Typical amplitude ratio of a real bandpass filter. 
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»" ] 
— P.f/B) 


where B equals the 3 dB bandwidth and P,( jf/B) is a complex polynomial. The 
family of Butterworth polynomials is defined by the property 


|P.Gf/B)F = 1 + (f/BY" 


A(f) [4a] 


so that 
1 


|H(F)| "MEUS 


Consequently, the first n derivatives of |H(f)| equal zero at f = 0 and we say that 
|H(f )| is maximally flat. Table 3.4-1 lists the Butterworth polynomials for n = 1 
through 4, using the normalized variable p = jf/ B. 

A first-order Butterworth filter has the same characteristics as an RC lowpass 
filter and would be a poor approximation of an ideal LPF. But the approximation 
improves as you increase n by adding more elements to the circuit. For instance, 
the impulse response of a third-order filter sketched in Fig. 3.4—4a bears obvious 
resemblance to that of an ideal LPF—without the precursors, of course. The 
frequency-response curves of this filter are plotted in Fig. 3.4-4b. Note that the 
phase shift has a reasonably linear slope over the passband, implying time delay 
plus some delay distortion. 


[4b] 


A(t) 






CFI 


arg Hf) 





la) (b) 


Figure 3.4-4 Third-order Butterworth LPF. (a) Impulse response; (b) transfer function. 
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Table 3.4-1 Butterworth polynomials 

n P,{P) 

1 1+p 

2 1+ Vip +p? 

3 (1 + p + pp) 

4 (1 + 0.765p + p?)(1 + 1.848p + p?) 


A clearer picture of the amplitude ratio in the transition region is obtained from a 
Bode diagram, constructed by plotting | H(f ) | in dB versus f on a logarithmic scale. 
Figure 3.4—5 shows the Bode diagram for Butterworth lowpass filters with various val- 
ues of n. If we define the edge of the stopband at |H(f)| = —20 dB, the width of the 
transition region when n = 1 is 10B — B = 9B but only 1.25B — B = 0.25B when 
n = 10. Clearly, |H(f)| approaches the ideal square characteristic in the limit as 
n — oo. At the same time, however, the slope of the phase shift (not shown) increases 
with n and the delay distortion may become intolerably large. 

In situations where potential delay distortion is a major concern, a Bessel- 
'Thomson filter would be the preferred choice. This class of filters 1s characterized 
by maximally linear phase shift for a given value of n, but has a wider transition 
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Figure 3.4-5 Bode diagram for Butterworth LPFs. 
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region. At the other extreme, the class of equiripple filters (including Chebyshev 
and elliptic filters) provides the sharpest transition for a given value of n; but these 
filters have small amplitude ripples in the passband and significantly nonlinear 
phase shift. Equiripple filters would be satisfactory in audio applications, for 
instance, whereas pulse applications might call for the superior transient perfor- 
mance of Bessel-Thomson filters. 

All three filter classes can be implemented with active devices (such as opera- 
tional amplifiers) that eliminate the need for bulky inductors. Switched-capacitor 
filter designs go even further and eliminate resistors that would take up too much 
space in a large-scale integrated circuit. All three classes can also be modified to 
obtain highpass or bandpass filters. However, some practical implementation prob- 
lems do arise when you want a bandpass filter with a narrow but reasonably square 
passband. Special designs that employ electromechanical phenomena have been 
developed for such applications. For example, Fig. 3.4-6 shows the amplitude ratio 
of a seventh-order monolithic crystal BPF intended for use in an AM radio. 


1.0 
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Figure 3.4-6 Amplitude ratio of a mechanical filter. 
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x(t) R C y(t) 


Figure 3.4-7 Second-order Butterworth LPF. 


We can obtain an expression for the transfer function as 


Zrce 
H = —— 
(f) Zgc + joL 
where 
R/jwC R 
Zi Sa d T ue 
R+1/joC 1+ jwRC 
Thus 
1 
Mf) = 


1 + jeL/R — w*LC 
-ł 


From Table 3.4—1 with p = jf/B, we want 


wor [rove 0T 


B 


The required relationship between R, L, and C that satisfies the equation can be 
found by setting 


ok VÀ. Viae VIC 


R 


which yields R = 4/—— 





EXERCISE 3.4-2 Show that a Butterworth LPF has |H(f)|gs ^ —20nlog;9(//B) when f > B. 
Then find the minimum value of n needed so that |H(f)| = 1/10 for f = 2B. 





Pulse Response and Risetime 


A rectangular pulse, or any other signal with an abrupt transition, contains signifi- 
cant high-frequency components that will be attenuated or eliminated by a lowpass 
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filter. Pulse filtering therefore produces a smoothing or smearing effect that must be 
studied in the time domain. The study of pulse response undertaken here leads to 
useful information about pulse transmission systems. 

Let's begin with the unit step input signal x(t) = u(t), which could represent 
the leading edge of a rectangular pulse. In terms of the filter's impulse response A(t), 
the step response will be 


g(t) £ | i h(A)u(t — A) dà = | h(a) da [5] 


—co —oo 


since u(t — A) = 0 for A > t. We saw in Examples 3.1—1 and 3.1-2 for instance, 
that a first-order lowpass filter has 


g(t) = (1 — e") 
where B is the 3 dB bandwidth. 
Of course a first-order LPF doesn’t severely restrict high-frequency transmis- 


sion. So let’s go to the extreme case of an ideal LPF, taking unit gain and zero time 
delay for simplicity. From Eq. (2b) we have h(t) = 2B sinc 2Bt and Eq. (5) becomes 


t 
g(t) = | 2B sinc 2BA dA 
0 2Bt 
-| sine jd. | sinc u du 
—oo 0 
where u = 2BÀ. The first integral is known to equal 1/2, but the second requires 


numerical evaluation. Fortunately, the result can be expressed in terms of the tabu- 
Jated sine integral function 





9 . 0/v 
Si (0) 2 | = da = a sinc u du [61 
0 0 


which is plotted in Fig. 3.4-8 for 0 > 0 and approaches the value 77/2 as 0 — co. 
The function is also defined for 0 < 0 by virtue of the odd-symmetry property 
Si(—0) = —Si(0). Using Eq. (6) in the problem at hand we get 


l 
+ — Si (2rBt) [7] 
T 


N|= 


g(t) = 


obtained by setting 0/ar = 2Bt. 

For comparison purposes, Fig. 3.4-9 shows the step response of an ideal LPF 
along with that of a first-order LPF. The ideal LPF completely removes all high fre- 
quencies |f| > B, producing precursors, overshoot, and oscillations in the step 
response. (This behavior is the same as Gibbs’s phenomenon illustrated in Fig. 
2.1-10 and in Example 2.4-2.) None of these effects appears in the response of the 
first-order LPF, which gradually attenuates but does not eliminate high frequencies. 
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Figure 3.4-8 The sine integral function. 
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Figure 3.4-9 Step response of ideal and firstorder LPFs. 


The step response of a more selective filter—a third-order Butterworth LPF, for exam- 
ple—would more nearly resemble a time-delayed version of the ideal LPF response. 

Before moving on to pulse response per se, there’s an important conclusion to be 
drawn from Fig. 3.4—9 regarding risetime. Risetime is a measure of the "speed" of a 
step response, usually defined as the time interval t, between g(t) = 0.1 and 
g(t) = 0.9 and known as the 10-90% risetime. The risetime of a first-order lowpass 
filter can be computed from g(t) as t, = 0.35/B, while the ideal filter has t, ^ 0.44/B. 
Both values are reasonably close to 0.5/B so we'll use the approximation 


1 
ipee [8] 
” 2B 
for the risetime of an arbitrary lowpass filter with bandwidth B. 
Our work with step response pays off immediately in the calculation of pulse 
response if we take the input signal to be a unit-height rectangular pulse with dura- 
tion T starting at t = 0. Then we can write 
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Figure 3.4-10 Pulse response of an ideal LPF. 


x(t) = u(t) — u(t — T) 


and hence 


y(t) = g(t) — e(t — 7) 
which follows from superposition. 
Using g(t) from Eq. (7), we obtain the pulse response of an ideal LPF as 


y(e) = = {Si Qui) — Si [2mB(e ~ 7)]) (91 


which is plotted in Fig. 3.4—10 for three values of the product Br. The response has a 
more-or-less rectangular shape when Br = 2, whereas it becomes badly smeared and 
spread out if Br < 1. The intermediate case Br = 1 gives a recognizable but not rec- 
tangular output pulse. The same conclusions can be drawn from the pulse response of 
a first-order lowpass filter previously sketched in Fig. 3.1—3, and similar results would 
hold for other input pulse shapes and other lowpass filter characteristics. 

Now we’re in a position to make some general statements about bandwidth 


requirements for pulse transmission. Reproducing the actual pulse shape requires a 
large bandwidth, say 


1 


gy 2 
mun 





B 


where Tmin represents the smallest output pulse duration. But if we only need to 
detect that a pulse has been sent, or perhaps measure the pulse amplitude, we can get 
by with the smaller bandwidth 
l 
2T 


Bz 





[10] 


min 
an important and handy rule of thumb. 

Equation (10) also gives the condition for distinguishing between, or resolving, 
output pulses spaced by Tmin or more. Figure 3.4—11 shows the resolution condition 
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Figure 3.4-11 Pulse resolution of an ideal LPF. B = 1/27. 





for an ideal lowpass channel with B = 3r. A smaller bandwidth or smaller spacing 
would result in considerable overlap, making it difficult to identify separate pulses. 
Besides pulse detection and resolution, we’ll occasionally be concerned with 
pulse position measured relative to some reference time. Such measurements have 
inherent ambiguity due to the rounded output pulse shape and nonzero risetime of 


leading and trailing edges. For a specified minimum risetime, Eq. (8) yields the 
bandwidth requirement 


Bea 





2r n 


another handy rule of thumb. 

Throughout the foregoing discussion we've tacitly assumed that the transmis- 
sion channel has satisfactory phase-shift characteristics. If not, the resulting delay 
distortion could render the channel useless for pulse transmission, regardless of the 
bandwidth. Therefore, our bandwidth requirements in Eqs. (10) and (11) imply the 
additional stipulation of nearly linear phase shift over | f| = B. A phase equaliza- 
tion network may be needed to achieve this condition. 





A certain signal consists of pulses whose durations range from 10 to 25 us; the pulses 
occur at random times, but a given pulse always starts at least 30 ws after the starting 
. time of the previous pulse. Find the minimum transmission bandwidth required for 
pulse detection and resolution, and estimate the resulting risetime at the output. 


EXERCISE 3.4-3 





3.5 QUADRATURE FILTERS AND HILBERT 
TRANSFORMS 


The Fourier transform serves most of our needs in the study of filtered signals since, 
in most cases, we are interested in the separation of signals based on their frequency 
content. However, there are times when separating signals on the basis of phase is 
more convenient. For these applications we'll use the Hilbert transform, which we’ ll 
introduce in conjunction with quadrature filtering. In Chap. 4 we will make use of 
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Figure 3.5-1 Transfer function of a quadrature phase shifter. 


the Hilbert transform in the study of two important applications: the generation of 
single-sideband amplitude modulation and the mathematical representation of band- 
pass signals. 

A quadrature filter is an allpass network that merely shifts the phase of posi- 
tive frequency components by —90° and negative frequency components by +90°. 
Since a +90° phase shift is equivalent to multiplying by e^/?" = +j, the transfer 
function can be written in terms of the signum function as 


-j fug 
H =- = 1 
of) j sgn f s f«0 [1a] 
which is plotted in Fig. 3.5—1. The corresponding impulse response is 
ho(t) = ES [15] 
x qt 


We obtain this result by applying duality to F[sgn t] = 1/jaf which yields 
S [l/jmt] = sgn(—f) = —sgnf,so 9 ![—jsgnf] = j/jvt = Mt. 
Now let an arbitrary signal x(f) be the input to a quadrature filter. The output 


signal y(t) = x(t) * ho(t) will be defined as the Hilbert transform of x(t), denoted 
by x(t). Thus 


Ed À 
: | 09 5, [21 


Apu SA 1 
xit) = x(t) x — = — 
( ) ( ) TÉ T t—À 
~oo 
Note that Hilbert transformation is a convolution and does not change the domain, 


so both x(t) and X(t) are functions of time. Even so, we can easily write the spectrum 
of x(t), namely 


F[x(t)] = (~j sgnf )X(f) [3] 
since phase shifting produces the output spectrum He f )X( f ). 

The catalog of Hilbert transform pairs is quite short compared to our Fourier trans- 
form catalog, and the Hilbert transform does not even exist for many common signal 
models. Mathematically, the trouble comes from potential singularities in Eq. (2) when 
À = tand the integrand becomes undefined. Physically, we see from Eq. (15) that Ao(t) 
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is noncausal, which means that the quadrature filter is unrealizable—although its 
behavior can be approximated over a finite frequency band using a real network. 

Although the Hilbert transform operates exclusively in the time domain, it has a 
number of useful properties. Those applicable to our interests are discussed here. In 
all cases we will assume that the signal x() is real. 


1. A signal x(t) and its Hilbert transform x(t) have the same amplitude spectrum. 
In addition, the energy or power in a signal and its Hilbert transform are also 
equal. These follow directly from Eq. (3) since |—j sgn f| = 1 for all f. 


2. If x(t) is the Hilbert transform of x(£), then —x(t) is the Hilbert transform of 
x(t). The details of proving this property are left as an exercise; however, it fol- 
lows that two successive shifts of 90° result in a total shift of 180°. 


3. A signal x(f) and its Hilbert transform x(t) are orthogonal. In Sect. 3.6 we will 
show that this means 


| x(t)x(t) dt = 0 for energy signals 


-00 





and 
p 
lim — | x(t)x(t) dt = O for power signals 
Too 2T -T 
EXAMPLE 3.5-1 Hilbert Transform of a Cosine Signal 


The simplest and most obvious Hilbert transform pair follows directly from the 
phase-shift property of the quadrature filter. Specifically, if the input is 

x(t) = A cos (wot + Q) 
then 


Au) = isen SX) = E- f) + AF f] sens 


A 
= 2j LEC — fo) + &(f + fo)] 
and thus x(t) = A sin (wot + ¢). 
This transform pair can be used to find the Hilbert transform of any signal that 
consists of a sum of sinusoids. However, most other Hilbert transforms involve per- 
forming the convolution operation in Eq. (2), as illustrated by the following example. 





EXAMPLE 3.5-2 ga Hilbert Transform of a Rectangular Pulse 


Consider the delayed rectangular pulse x(t) = A[u(t) — u(t — 7)]. The Hilbert 
transform is 





| 


Maius 
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(b) 


Figure 3.5-2 Hilbert transform of a rectangular pulse. (a) Convolution; {b} result. 





A Af 1 
x(t) = dx 
7 0 t—A 


whose evaluation requires graphical interpretation. Figure 3.5-2a shows the case 
O < t < 7/2 and we see that the areas cancel out between A = O and A = 2t, leaving 
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This result also holds for 7/2 <t< v, when the areas cancel out between 
A = 2t — t and A = r. There is no area cancellation fort < O or t > 7, and 


sre dn nos t 
n T)hi-A m” t=T 


These separate cases can be combined in one expression 














A t 


Ji 


X(t) = 


which is plotted in Fig. 3.5—2b along with x(t). 


The infinite spikes in X(t) att = 0 and t = 7 can be viewed as an extreme man- 
ifestation of delay distortion. See Fig. 3.2—5 for comparison. 





In [4] 








t=T 





The inverse Hilbert transform recovers x(t) from x(t). Use spectral analysis to show 
that x(t) * (—1/at) = x(t). 


EXERCISE 3.5-1 
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3.6 CORRELATION AND SPECTRAL DENSITY 


This section introduces correlation functions as another approach to signal and sys- 
tem analysis. Correlation focuses on time averages and signal power or energy. Taking 
the Founer transform of a correlation function leads to frequency-domain representa- 
ton in terms of spectral density functions, equivalent to energy spectral density in 
the case of an energy signal. In the case of a power signal, the spectral density function 
tells us the power distribution over frequency. 

But the signals themselves need not be Fourier transformable. Hence, spectral 
density allows us to deal with a broader range of signal models, including the impor- 
tant class of random signals. We develop correlation and spectral density here as 
analytic tools for nonrandom signals. You should then feel more comfortable with 
them when we get to random signals in Chap. 9. 





Correlation of Power Signals 


Let v(t) be a power signal, but not necessarily real nor periodic. Our only stipulation 
is that it must have well-defined average power 


P, S (WOD = (were) = 0 (1] 


The time-averaging operation here is interpreted in the general form 
_ q [72 
(2(t)) = im. | z(t) dt 
-T/2 


where z(f) is an arbitrary time function. For reference purposes, we note that this 
operation has the following properties: 


(z*(t)) = (2())* [2a] 
(z(t — 14)) = (z(t)) any ta [2b] 
(ayzi(t) + azza(t)) = aiGi() + az(z2(t)) [2d 


We'll have frequent use for these properties in conjunction with correlation. 

If v(t) and w(2) are power signals, the average (u(t)w*(t)) is called the scalar 
product of v(t) and w(t). The scalar product is a number, possibly complex, that 
serves as a measure of similarity between the two signals. Schwarz's inequality 
relates the scalar product to the signal powers P, and P,,, in that 


|(u(t)w*(z))? = PLP, [3] 


You can easily confirm that the equality holds when v(t) = aw(t), with a being an 
arbitrary constant. Hence, |(u(t)w*(t))| is maximum when the signals are propor- 
tional. We'll soon define correlation in terms of the scalar product. 

First, however, let's further interpret (u(t)w*(t)) and prove Schwarz's inequal- 
ity by considering 


z(t) = v(t) — aw(t) ^ dd 


| 
| 
| 
| 
| 
| 
| 
i 
| 
| 
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The average power of z(f) is 
P, = (et)z*(t)) = ([v(t) — aw()]o*(t) — a*w*()]) [4b] 
= (v(t)*(t)) + aa*(w(t)w*(t)) — a*(v(2)w*()) — a(o lew) 
= P, + aa*P,, — 2 Re [a*(v(t)w*(t))] 


where Eqs. (2a) and (2c) have been used to expand and combine terms. If a — 1, 
then z(t) = v(t) — w(t) and - 


P, =P, + P, - 2 Re (u(t)w*(t)) 


À large value of the scalar product thus implies similar signals, in the sense that the 
difference signal v(t) — w(t) has small average power. Conversely, a small scalar 
product implies dissimilar signals and P, ~ P, + P 

To prove Schwarz’s inequality from Eq. (4b), let a = (v(t)w*(t))/P,, so 

aa*P,, = a*(u(t)w*(t)) = (vw) P/P, 

Then P, = P, — |(v(t)w*(t)) P/P,, = 0, which reduces to Eq. (3) and completes the 
preliminary work. 

Now we define the crosscorrelation of two power signals ast 


Ryy(t) = (v(r)w*(t — 7)) = (v(t + 7)w*(t)) [5] 
This is a scalar product with the second signal delayed by 7 relative to the first or, 
equivalently, the first signal advanced by 7 relative to the second. The relative dis- 


placement 7 is the independent variable in Eq. (5), the variable t having been 
washed out in the time average. General properties of R,,,,(7) are 


IRT)? = P,P, [6a] 

Ro) = REET) [65] 

Equation (6a) simply restates Schwarz’s inequality, while Eq. (6b) points out that 
Ry(7) * Ry). 

We conclude from our previous observations that R,, (7) measures the similarity 

between u(t) and w(t — 7) as a function of r. Crosscorrelation is thus a more sophis- 


ticated measure than the ordinary scalar product since it detects time-shifted simi- 
larities or differences that would be ignored in (u(t)w*(t)). 


But suppose we correlate a signal with itself, generating the autocorrelation 
function 


Rc) Ê Rav) = (v(v*(t — 7) = (volt + c)u*(0) uj 


This autocorrelation tells us something about the time variation of u(t), at least in an 
averaged sense. If |R,(7)| is large, we infer that u(t — 7) is very similar to v(t) for 


TAnother definition used by some authors is (v*(t)w(t + 7)), equivalent to interchanging the sub- 
scripts on R,, (7) in Eq. (5). 
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that particular value of 7; whereas if |R,(7)| is small, then v(7) and v(t — 7) must 
look quite different. 
Properties of the autocorrelation function include 


R,(0) = P, [8a] 
IR.(7)| = R,(0) [8b] 
Ry(-7) = Ri) [8d] 


Hence, R,(7) has hermitian symmetry and a maximum value at the origin equal to 
the signal power. If v(7) is real, then R,(7) will be real and even. If v(t) happens to 
be periodic, R,(7) will have the same periodicity. 

Lastly, consider the sum or difference signal 


z(t) = v(t) X w(t) [9a] 
Upon forming its autocorrelation, we find that 
RAT) = Rr) RS IR RU [9b] 


If v(t) and w(t) are uncorrelated for all 7, so 
Ro) = Ry(7) = 0 
then Rí(r) = R,(r) + R,(7) and setting 7 = 0 yields 
P= Prt Pe 


Superposition of average power therefore holds for uncorrelated signals. 


Correlation of Phasors and Sinusoids 


The calculation of correlation functions for phasors and sinusoidal signals is expe- 
dited by calling upon Eq. (18), Sect. 2.1, written as 


jat — jot : 1 d j (ww) t 
(elie 7») = lim — gm dr [10] 
pue -1/2 
ouo (1 9)T p Q5 F 0 
= lim sine -——— —— = 
Tou T 1 Wa = €, 


We'll apply this result to the phasor signals 
v(t) = C, e*t w(t) = C, el"! [11a] 


where C, and C, are complex constants incorporating the amplitude and phase 
angle. The crosscorrelation is 


Rer) = [cue] Cete) 


Exe C, C£ el^"? (gite ud 
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=e s My F Dy 1116] 
Q,Ckelve Wy = Wy 


Hence, the phasors are uncorrelated unless they have identical frequencies. The 
autocorrelation function is 


R XT) He | C, [em 


nid 
which drops out of Eq. (11b) when w(t) = v(t). 
Now it becomes a simple task to show that the sinusoidal signal 
z(t) = A cos (wot + Q) [12a] 
has 
A? 
Rift) = 75 €08 wor [125] 


Clearly, R,(T) is real, even, and periodic, and has the maximum value 
R{0) = A?/2 = P, This maximum also occurs whenever «7 equals a multiple of 
24r radians, so z(t + T) = z(t). On the other hand, R(T) = 0 when z(t + 7) and 
z(t) are in phase quadrature. 

But notice that the phase angle $ does not appear in R (7T), owing to the aver- 


aging effect of correlation. This emphasizes the fact that the autocorrelation func- 
tion does not uniquely define a signal. 





and (11). 





Correlation of Energy Signals 


Averaging products of energy signals over all time yields zero. But we can mean- 
ingfully speak of the total energy 


E, £ | v(t)u*(t) dt = 0 (13] 
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Similarly, the correlation functions for energy signals can be defined as 


Ry(T) £ | v(t)w*(t — 7) dt [14a] 
R(t) = R,,(7) [14b] 


Since the integration operation f Lu EO dt has the same mathematical properties as 
the time-average operation (z(t)), all of our previous correlation relations hold for 


Derive Eq. (12b) by writing z(£) as a sum of conjugate phasors and applying Eqs. (9) EXERCISE 3.6-1 
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the case of energy signals if we replace average power P, with total energy E,. Thus, 
for instance, we have the property 


IR (7) = EE, [15] 
as the energy-signal version of Eq. (6a). 
Closer examination of Eq. (14) reveals that energy-signal correlation is a type 


of convolution. For with z(t) = w*(—t) and t = A, the right-hand side of Eq. (14a) 
becomes 


oo 
| v(A)z(r — A) dA = U(r) * z(7) 
and therefore 
Ry(r) = v(7) * w*(—7) [16] 
Likewise, R(T) = v(t) * v*(—7). 
Some additional relations are obtained in terms of the Fourier transforms 
Vif) = F[v(t)], etc. Specifically, from Eqs. (16) and (17), Sect. 2.2, 


R0) =E= | OPA 


eo 


R40) = | vest) ar T vana 


—oo - 


Combining these integrals with |R,,,(0)|? « E,E, = R,(0)R, (0) yields 








| vowel <] wore] mora un 


-0 - 


Equation (17) is a frequency-domain statement of Schwarz’s inequality. The equal- 
ity holds when V(f) and W(f) are proportional. 


Pattern Recognition 


Crosscorrelation can be used in pattern recognition tasks. If the crosscorrelation of 
objects A and B is similar to the autocorrelation of A, then B is assumed to match A. 
Otherwise B does not match A. For example, the autocorrelation of x(t) = lI(f) can be 
found from performing the graphical correlation in Eq. (145) as R(T) = A(7). If we 
examine the similarity of y(t) = 211(t) to x(¢) by finding the crosscorrelation R(7) = 
2 (T), we see that R,,(7) is just a scaled version of R,(r). Therefore y(t) matches x(t). 
However, if we take the crosscorrelation of z(t) = u(t) with x(t), we obtain 


1 forr < —1/2 
R,{7) = 41/2 -— 7 for —1/2 = 7 £S 1/2 
0 for r > 1/2 


and conclude that z(t) doesn't match x(t). 


e mme 


3.6 Correlation and Spectral Density 


This type of graphical correlation is particularly effective for signals that do not 
have a closed-form solution. For example, autocorrelation can find the pitch (funda- 
mental frequency) of speech signals. The crosscorrelation can determine if two speech 
samples have the same pitch, and thus may have come from the same individual. 





Let u(t) = A[u(t) — u(t — D)] and w(t) = u(t — ty). Use Eq. (16) with z(7) = 
to sketch R,,(7). Confirm from your sketch that |R,,(r)|? = E,E, and that 
|R sult) ona = E,E, atT — aly. 





We next investigate system analysis in the "7 domain," as represented by 
Fig. 3.6-1. A signal x(f) having known autocorrelation R,(7) is applied to an LTI 
system with impulse response A(t), producing the output signal 


y(t) = A(t) * x(t) = i h(A) x(t — A) da 


— oO 


We'll show that the input-output crosscorrelation function is 


Ry(7) = h(t) * RÈT) = | h(A) R,(r — A) dA [18] 
and that the output autocorrelation function is 
R(T) = h*(—T) * Ry(7) z | h*(— p )Ry. E L) du [9a] 


Substituting Eq. (18) into (19a) then gives 


Ry(7) = h*(—T) * h(t)» RT) [19b] 
Note that these r-domain relations are convolutions, similar to the time-domain 
relation. 

For derivation purposes, let's assume that x(f) and y(t) are power signals so we 
can use the compact time-averaged notation. Obviously, the same results will hold 
when x(t) and y(t) are both energy signals. The assumption of a stable system 
ensures that y(t) will be the same type of signal as x(f). 

Starting with the crosscorrelation R(T) = (y(t)x*(t — 7)), we insert the 
superposition integral h(f)x*(t) for y(t) and interchange the order of operations to get 


R,,(T) = | h(A)(x(t — A)x*(t — 7)) dA 


— c0 


x(t) y(t) 
————— h(t) 
Rr) Ry) 


Figure 3.6-1 
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But since (z(t)) = (z(t + A)) for any A, 
(x(t — A) 7)) = (x(t +A — Alt A-T) 
= (x(t)x*[t — (7 — A)]) 
= R(T- À) 


Hence, 


R(T) = iG h(A)R,(7 — A) dA 


Proceeding in the same fashion for R (r) = (y(t)y*(t — 7)) we arrive at 


Ry) = | h*(A)(y(t)x*(t — T — A))dA 
in which (y(t)x*(¢ — 7 — A)) = Ry(r + A). Equation (19a) follows from the 
change of variable u = — A. 


Spectral Density Functions 


At last we're prepared to discuss spectral density functions. Given a power or energy 
signal v(t), its spectral density function G,( f ) represents the distribution of power 
or energy in the frequency domain and has two essential properties. First, the area 
under G,( f ) equals the average power or total energy, so 


| G,(f) df = R,(0) [20] 
Second, if x(t) is the input to an LTI system with H(f) = F[A(t)], then the input 
and output spectral density functions are related by 


Gf) = HODPG) [21] 


since |H(f)|? is the power or energy gain at any f. These two properties are com- 
bined in 
eo 


Ry(0) = | BC) Gf) df [22] 


which expresses the output power or energy R,(0) in terms of the input spectral 
density. 

Equation (22) leads to a physical interpretation of spectral density with the help 
of Fig. 3.6-2. Here, G,(f) is arbitrary and | H( f) acts like a narrowband filter with 
unit gain, so 

Af Af 


Gs BE SI 


0 otherwise 
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GCF) 


0 fe 
IES) 








Figure 3.6-2 Interpretation of spectral density functions. 


If Af is sufficiently small, the area under G,(f) will be R,(0) = G,(f,) Af and 
GF.) = Ry(0)/Af 


We conclude that at any frequency f = f., G.( f.) equals the signal power or energy 
per unit frequency. We further conclude that any spectral density function must be 
real and nonnegative for all values of f. 

But how do you determine G,(f) from v(f)? The Wiener-Kinchine theorem 


states that you first calculate the autocorrelation function and then take its Fourier 
transform. Thus, 


Gf) = F,[R,(7)] | Rr jeer dt [23a] 


— cO 


where 9, stands for the Fourier transform operation with 7 in place of t. The inverse 
transform is 


R(t) = 97[G,/)] 4 | C Gf yel af (238 


— oo 


so we have the Fourier transform pair 


R(t) e Gf) 


All of our prior transform theorems therefore may be invoked to develop relation- 
ships between autocorrelation and spectral density. 
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If v(t) is an energy signal with Vif) = F[v(t)], application of Eqs. (16) and 
(23a) shows that 


Gf) = IW? [24] 


and we have the energy spectral density. If v(/) is a periodic power signal with the 
Fourier series expansion 


oo 


v(t) = Y c(nf)e? "^! [25a] 


n--—oo 


the Wiener-Kinchine theorem gives the power spectral density, or power spec- 
trum, as 


Gf) ^ let) POF — nf) [2541 


This power spectrum consists of impulses representing the average phasor power 
|c(nfo)|? concentrated at each harmonic frequency f = nf}. Substituting Eq. (25b) 
into Eq. (20) then yields a restatement of Parseval's power theorem. In the special 
case of a sinusoidal signal 


z(t) = A cos (wot + o) 
we use R (7) from Eq. (12b) to get 
Gf) = 9,[(47/2) cos 2m for] 


A? A? 
= a=) + "4 OF + fo) 


which is plotted in Fig. 3.6—3. 

All of the foregoing cases lend support to the Wiener-Kinchine theorem but do 
not constitute a general proof. To prove the theorem, we must confirm that taking 
Gf) = 9,(R,(7)] satisfies the properties in Eqs. (20) and (21). The former 
immediately follows from the inverse transform in Eq. (23b) with 7 = 0. Now recall 
the output autocorrelation expression 


Ry(r) = h*(—r) * A(z) * R(T) 


Gf) 


A2/4 A?/4 


-f 0 fo 


Figure 3.6-3 Power spectrum of zl) = A cos (mot 9). 
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Since 
S,[hr))- HU) — &n*(-7)] = B) 
the convolution theorem yields 
S [R0] = HA AHA)F[RA7)] 
and thus Gf) = |H(f)|°G.(f) if we take 9,[R (7)] = G,(f), etc. 





The signal x(t) = sinc 10t is input to the system in Fig. 3.6-1 having the transfer 
function 
= f —j4nf 
H(f) = 30 4 J* d 
We can find the energy spectral density of x(t) from Eq. (24) 


G0) = IP = xs n( 4) 


and the corresponding spectral density of the output y(?) 


Gf) = IG FG) 
= [S laos G)] 
"e" (s) 


since the amplitudes multiply only in the region where the functions overlap. There 
are several ways to find the total energies E, and Ej. We know that 


EXAMPLE 3.6-3 


—oco = 


5 
1 1 
— ——d = — 
L 100 4 ~ 10 


E-| OPa = | mora [e af 


Or we can find R(T) = #7'{G,(f )} = $ sine 10t from which E, = R,(0) = $. 
Similarly, 


g= | bores | moras lee 


— oo = 


2 
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= df= 
| md 


9 





w 
n 


134 CHAPTER3 € Signal Transmission and Filtering 


And correspondingly R,(r) = 9;!(G(f)) = x sinc 4t which leads to the same 
result that E, = R,(0) = *. We can find the output signal y(t) directly from the 
relationship 


3 , 
H = xa) = 2 n(£ ere 
10 4 
by doing the same type of multiplication between rectangular functions as we did 
earlier for the spectral density. Using the Fourier transform theorems, 
y(t) = É sinc 4(r — 2). 





EXAMPLE 3.6-4 Comb Filter - 
Consider the comb filter in Fig. 3.6—4a. The impulse response is 

h(t) = e(t) — lt — T) 
so 

O i 
and 
A(f) |? = 2 — eT — eat 
= 4 sin? 2a(f/f) | f, =2/T 


The sketch of | H(f )|* in Fig. 3.6-4b explains the name of this filter. 
If we know the input spectral density, the output density and autocorrelation can 
be found from 


Gf) = 4 sin’ 2«(f/f.) Gf) 


HCY 


4 
y(t) 





-flá 0 £4 f2 3f/4 f 
a) (b) 


Figure 3.6-4 Comb filter. 
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R(t) = 9 [G(f)] 


If we also know the input autocorrelation, we can write 


Ry) = FAP) P] * R C£) 


where, using the exponential expression for | H( f), 


Se [[BCP) P] = 28(7) — 8(r — T) - 8(r + T) 


Therefore, 


R,(7) = 2R,(r) = R(t = T) Es RAT + T) 


and the output power or energy is R,(0) = 2R,(0) — R(-T) — R(T). 





Gf) = 


3.1-1 


3.1-2 
3.1-3* 
3.1-4 
3.1-5 
3.1-6 


3.1-7 
3.1-8 
3.1-9* 


3.1-10 
3.1-11 
3.1-12 


3.1-13 


Let v( be an energy signal Show that ¥,[v*(—7)] = V*(f). Then derive 
IV £) by applying Eq. (23a) to Eq. (16). 


EXERCISE 3.6-3 


3.7 PROBLEMS 


A given system has impulse response h(t) and transfer function H(f). Obtain 
expressions for y(t) and Y(f) when x(t) = A[6(t + tz) — 8(t — t;)]. 


Do Prob. 3.1-1 with x(t) = A[6(t + ta) + ó(t)] 

Do Prob. 3.1-1 with x(t) = AA(t — t). 

Do Prob. 3.1-1 with x(t) = Au(t — t). 

Justify Eq. (7b) from Eq. (14) with x(t) = u(t). 

Find and sketch |H(f)| and arg H(f) for a system described by the differential 
equation dy(t)/dt + 4ary(t) = dx(t)/dt + 16rx(t). 

Do Prob. 3.1-6 with dy(t)/dt + 16my(t) = dx(t)/dt + 47rx(t). 

Do Prob. 3.1-6 with dy(t)/dt — 4ary(t) = — dx(t)/dt + 4wx(t). 


Use frequency-domain analysis to obtain an approximate expression for y(t) when 
H(f) = B/(B + jf) and x(t) is such that X(f) ~ 0 for |f| < W with W > B. 
Use frequency-domain analysis to obtain an approximate expression for y(f) when 
H(f) = jf/(B + jf) and x(f) is such that X(f) =~ 0 for |f| > W with W < B. 
The input to an RC lowpass filter is x(t) = 2 sinc 4Wt. Plot the energy ratio Ej/E, 
versus B/W. 


Sketch and label the impulse response of the cascade system in Fig. 3.1-8b5 when 
the blocks represent zero-order holds with time delays T, > T». 


Sketch and label the impulse response of the cascade system in Fig. 3.1-85 when 


Hi(f) = {1+ j(f/B)]^! and the second block represents a zero-order hold with 
time delay T >> 1/B. 
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3.1-14* 


3.1-15 


3.1-16t 


3.2-1 


3.2-2 


3.2-3* 


3.2-4 


3.2-5 


3.2-6 


3.2-7 


3.2-8* 
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Find the step and impulse response of the feedback system in Fig. 3.1-8c when 
H,(f) is a differentiator and H;( f) is a gain K. 


Find the step and impulse response of the feedback system in Fig. 3.1-8c when 
H,(f) is a gain K and H»( f) is a differentiator. 


If H(f) is the transfer function of a physically realizable system, then h(t) must be 
real and causal. As a consequence, for t = 0 show that 


A(t) = «| HLF) cos wt df = af HUN) cos wt df 
0 0 


where H(f) = Re(H(f)] and Hf) = Im[H(f)]. 


Show that a first-order lowpass system yields essentially distortionless transmission 
if x(t) is bandlimited to W < B. 


Find and sketch y(t) when the test signal x(t) = 4 cos wot + § cos 3wot + 
$ cos 5w, which approximates a triangular wave, is applied to a first-order low- 
pass system with B = 3f. 


Find and sketch y(t) when the test signal from Prob. 3.2-2 is applied to a first-order 
highpass system with H(f) = jf/(B + jf) and B = 3f. 
The signal 2 sinc 40t is to be transmitted over a channel with transfer function H( f). 


The output is y(t) = 20 sinc (40t — 200). Find H(f) and sketch its magnitude and 
phase over | f| 30. 


Evaluate t,(f) at f = 0, 0.5, 1, and 2 kHz for a first-order lowpass system with 
B = 2kHz. 


A channel has the transfer function 


£) —jnf/30 < 
an(£ e for| f| = 15 Hz 


H(f) = 
an( e for| f| > 15 Hz 


Sketch the phase delay ¢,(f) and group delay t,(f). For what values of f does 
tf) = tf)? 

Consider a transmission channel with H,(f) = (1 + 2a cos co T)e 7*7, which has 
amplitude ripples. (a) Show that y(t) = ax(t) + x(t — T) + ax(t — 2T), so the 
output includes a leading and trailing echo. (b) Let x(t) = II(t/) and a = 1/2. 
Sketch y(t) for 7 = 27/3 and 47/3. 

Consider a transmission channel with Hc(f) = exp[—j(wT — a sin o T)], which has 


phase ripples. Assume |a| << 7/2 and use a series expansion to show that the out- 
put includes a leading and trailing echo. 





3.2-9 
3.2-10 
3.2-11 


3.2-12 
3.3-1* 


3.3-2 
3.3-3 


3.3-4 


3.3-5 
3.3-6* 


3.3-7 
3.3-8 


3.3-9 


3.4-] 


3.4-2* 


3.4-3 
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Design a tapped-delay line equalizer for H,( f) in Prob. 3.2-8 with a = 0.4. 
Design a tapped-delay line equalizer for H,(f) in Prob. 3.2-7 with a = 0.4. 
Suppose x(t) = A cos «f is applied to a nonlinear system with y(t) = 2x(f) — 3x*(t). 


Write y(f) as a sum of cosines. Then evaluate the second-harmonic and third- 
harmonic distortion when A = 1 and A = 2. 


Do Prob. 3.2-11 with y(t) = 5x(t) — 2x?(t) + 4x'(t). 
Let the repeater system in Fig. 3.3-2 have P,, = 0.5 W, a = 2 dB/km, and a total 


path length of 50 km. Find the amplifier gains and the location of the repeater so that 
Pou = 50 mW and the signal power at the input to each amplifier equals 20uW. 


Do Prob. 3.3-1 with P,, = 100 mW and P,,, = 0.1 W. 


- A 400 km repeater system consists of m identical cable sections with a = 0.4 dB/km 


and m identical amplifiers with 30. dB maximum gain. Find the required number of 
sections and the gain per amplifier so that Pou = 50 mW when Pj, = 2W. 


A. 3000 km repeater system consists of m identical fiber-optic cable sections with 
a = 0.5 dB/km and m identical amplifiers. Find the required number of sections and 
the gain per amplifier so that Pout = Pin = 5 mW and the input power to each ampli- 
fier is at least 67 uW. 


Do Prob. 3.3-4 with œ = 2.5 dB/km. 


Suppose the radio link in Fig. 3.3—4 has f = 3 GHz, £ = 40 km, and P, = SW. If 
both antennas are circular dishes with the same radius r, find the value of r that 
yields Py, = 2 uW. 

Do Prob. 3.3-6 with f = 200 MHz and ¢ = 10 km. 


The radio link in Fig. 3.34 is used to transmit a metropolitan TV signal to a rural 
cable company 50 km away. Suppose a radio repeater with a total gain of gyt 
(including antennas and amplifier) is inserted in the middle of the path. Obtain the 
condition on the value of g,,, so that Pou is increased by 20 percent. 


A direct broadcast satellite (DBS) system uses 17 GHz for the uplink and 12 GHz 
for the downlink. Using the values of the amplifiers from Example 3.3-1, find Pou 
assuming Pj, = 30 dBW. 


Find and sketch the impulse response of the ideal HPF defined by Eq. (1) with 
f, co. 


Find and sketch the impulse response of an ideal band-rejection filter having 


H(f) = 0 for f, — B/2 < |f| € f, + B/2 and distortionless transmission for all 
other frequencies. 


Find the minimum value of n such that a Butterworth filter has |H(f)| = —1 dB for 
|f| < 0.7B. Then calculate |H(3B)| in dB. 


138 


3.4-4 


3.4-5 


3.4-6 


3.4-7 
3.4-8* 


3.4-9 


3.4-10t 


3.4-11f 


3.5-1 


3.5-2* 


3.5-3 
3.5-4 
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Find the minimum value of n such that a Butterworth filter has |H(f)| = —1 dB for 
|f| < 0.9B. Then calculate |H(3B)| in dB. 

The impulse response of a second-order Butterworth LPF is A(t) = 
2be "sin bt u(t) with b = 27B/~/2. Derive this result using a table of Laplace 
transforms by taking p = s/27rB in Table 3.4-1. 

Let R = VIJC in Fig. 3.4-7. (a) Show that JH(f)? = [1 — (fh)? + (A) 
with fy = 1/2rm VLC. (b) Find the 3 dB bandwidth in terms of fọ. Then sketch 
|H(f)| and compare with a second-order Butterworth response. 


Show that the 10-90% risetime of a first-order LPF equals 1/2.87B. 
Use A(t) given in Prob. 3.4—5 to find the step response of a second-order Butterworth 
LPF. Then plot g(r) and estimate the risetime in terms of B. 


Let x(t) = A sinc 4Wt be applied to an ideal LPF with bandwidth B. Taking the dura- 
tion of sinc at to be T = 2/a, plot the ratio of output to input pulse duration as a 
function of B/W. 

The effective bandwidth of an LPF and the effective duration of its impulse 
response are defined by 


| now NOE: 
Bes 7 38) T Th) Inn 


Obtain expressions for H(0) and |A(t)| from F[A(t)] and 9 ![H(f)], respectively. 
Then show that 7, = 1/2 Beg. 


Let the impulse response of an ideal LPF be truncated to obtain the causal function 


and A(t) = 0 elsewhere. (a) Show by Fourier transformation that 
K . 
H(f) = —e {Si [nf + B)ta} — Si [2a(f - B)tal} 


(b) Sketch A(t) and |H(f)| for t, >> 1/B and t, = 1/2B. 

Let x(t) = ó(t). (a) Find x(t) from Eq. (2) and use your result to confirm that 
¥-| —j sgnf | = 1/«t. (b) Then derive another Hilbert transform pair from the 
property X(t)*(—1/at) = x(t). 

Use Eq. (3), Sect. 3.1, and the results in Example 3.5—2 to obtain the Hilbert trans- 
form of AII(r/7). Now show that if v(t) = A for all time, then d(¢) = 0. 

Use Eq. (3) to show that if x(t) = sinc 2Wt then x(t) = «Wt sinc? Wt. 

Find the Hilbert transform of the signal in Fig. 3.2-3 using the results of Example 
3.5-1. 


numee e e a MÀ re r À 


rmm niu ab a laria antad aea atta pe dedit 


ETE NU UNE TERCER 





3.5-5* 


3.5-6 
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3.6-1 
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3.6-5* 
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Find the Hilbert transform of the signal 
x(t) = 4 cos wot + $ cos 3wot + 55 cos Swf. 


Show that the functions that form the Hilbert transform pair in Prob. 3.5-3 have the 
same amplitude spectrum by finding the magnitude of the Fourier transform of each. 
(Hint: Express the sinc? term as the product of a sine function and sinc function.) 
Show that pes x(t)x(t)dt = 0 for x(t) = A cos wot. 

Let the transfer function of a filter be written in the form H(f) = Hf) + jH,(f), 
as in Eq. (10), Sect. 2.2. If the filter is physically realizable, then its impulse 
response must have the causal property h(t) = 0 for t < 0. Hence, we can write 
h(t) = (1 + sgnt)h,(t) where h,(t)  ih(|r) for —oo < t < oo. Show that 
F[h.(t)] = Hf) and thus causality requires that H,(f) = —H,(f). 

Prove Eq. (6b). 

Let v(t) be periodic with period Ty. Show from Eq. (7) that R,(7) has the same peri- 
odicity. 

Derive Eq. (8b) by taking w(t) = v(t — 7) in Eq. (3). 

Use the method of pattern recognition demonstrated in Example 3.6—2 to determine 
whether y(t) = sin 2wof is similar to x(t) = cos 2wof. 

Use Eq. (24) to obtain the spectral density, autocorrelation, and signal energy when 
u(t) = AIl[(t — t4)/D]. 

Do Prob. 3.6-5 with v(t) = A sinc AW(t + t4). 

Do Prob. 3.6-5 with v(t) = Ae u(t). 

Use Eq. (25) to obtain the spectral density, autocorrelation, and signal power when 
v(t) = Ag + A, sin (wot + 9). 

Do Prob. 3.6-8 with v(t) = A, cos (wot + $4) + Asin (2wot + $). 

Obtain the autocorrelation of v(t) = Au(t) from Eq. (7). Use your result to find the 
signal power and spectral density. 


The energy signal x(t) = I1(10f) is input to an ideal lowpass filter system with 
K = 3, B = 20, and t, = 0.05, producing the output signal y(£). Write and simplify 
an expression for R (7). 
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he several purposes of modulation were itemized in Chap. 1 along with a qualitative description of the process. 

To briefly recapitulate: modulation is the systematic alteration of one waveform, called the carrier, according to 
the characteristics of another waveform, the modulating signal or message. The fundamental goal is to produce an 
information-bearing modulated wave whose properties are best suited to the given communication task. 

We now embark on a tour of continuous-wave (CW) modulation systems. The carrier in these sys- 
tems is a sinusoidal wave modulated by an analog signal—AM and FM radio being familiar examples. The abbre- 
viation CW also refers to on-off keying of a sinusoid, as in radio telegraphy, but that process is more accurately 
termed interrupted continuous wave |ICVV). 

This chapter deals specifically with linear CW modulation, which involves direct frequency translation of the 
message spectrum. Double-sideband modulation [DSB] is precisely that. Minor modifications of the translated spec- 
trum yield conventional amplitude modulation (AM|], single-sideband modulation (SSB), or vestigial-sideband modu- 
lation [VSB]. Each of these variations has its own distinct advantages and significant practical applications. Each will 
be given due consideration, including such matters as waveforms and spectra, modulation methods, transmitters, and 


demodulation. The chapter begins with a general discussion of bandpass signals and systems, pertinent fo all forms 
of CW modulation. l 


OBJECTIVES 





After studying this chapter and working the exercises, you should be able to do each of the following: 

1. Given a bandpass signal, find its envelope and phase, in-phase and quadrature components, and lowpass equiva- 
lent signal and spectrum (Sect. 4.1). 

2. State and apply the fractional-bandwidth rule of thumb for bandpass systems (Sect. 4.1). 


Sketch the waveform and envelope of an AM or DSB signal, and identify the spectral properties of AM, DSB, 
SSB, and VSB (Sects. 4.2 and 4.4). 


4, Construct the line spectrum and phasor diagram, and find the sideband power and total power of an AM, DSB, 
SSB or VSB signal with tone modulation (Sects. 4.2 and 4.4). 


5. Distinguish between product, power-law, and balanced modulators, and analyze a modulation system (Sect. 4.3). 
6. Identify the characteristics of synchronous, homodyne, and envelope detection (Sect. 4.5). 





4.1 BANDPASS SIGNALS AND SYSTEMS 


Effective communication over appreciable distance usually requires a high-frequency 
sinusoidal carrier. Consequently, by applying the frequency translation (or modula- 
tion) property of the Fourier transform from Sect. 2.3 to a bandlimited message sig- 
nal, we can see that most long-haul transmission systems have a bandpass fre- 
quency response. The properties are similar to those of a bandpass filter, and any 
signal transmitted on such a system must have a bandpass spectrum. Our purpose 
here is to present the characteristics and methods of analysis unique to bandpass 
systems and signals. Before plunging into the details, let's establish some conven- 
tions regarding the message and modulated signals. 





4.1 Bandpass Signals and Systems 


Analog Message Conventions 


Whenever possible, our study of analog communication will be couched in terms of 
an arbitrary message waveform x(t)—which might stand for a sample function from 
the ensemble of possible messages produced by an information source. The one 
essential condition imposed on x(f) is that it must have a reasonably well-defined 
message bandwidth W, so there's negligible spectral content for | f | > W. Accord- 
ingly, Fig. 4.1-1 represents a typical message spectrum X(f) = F[x(t)] assuming 
the message is an energy signal. 

For mathematical convenience, we'll also scale or normalize all messages to 
have a magnitude not exceeding unity, so 


|x(t)| s 1 [1] 
This normalization puts an upper limit on the average message power, namely 
S, = (xt) x1 [2] 


when we assume x(t) is a deterministic power signal. Both energy-signal and power- 
signal models will be used for x(t), depending on which one best suits the circum- 
stances at hand. l 

Occasionally, analysis with arbitrary x(t) turns out to be difficult if not impossi- 


ble. As a fall-back position we may then resort to the specific case of sinusoidal or 
tone modulation, taking 


X(t) = Am COS Opt Am = 1 fa< W [3] 


Tone modulation allows us to work with one-sided line spectra and simplifies power 
calculations. Moreover, if you can find the response of the modulation system at a 
particular frequency fn you can infer the response for all frequencies in the message 
band—barring any nonlinearities. To reveal potential nonlinear effects, you must 
use multitone modulation such as 


x(t) = A, cos wit + A, cos wt + cc 


with A, + A; + ++: x 1 to satisfy Eq. (1). 


XC 





Figure 4.1-1 Message spectrum with bandwidth W. 
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Bandpass Signals 


We next explore the characteristics unique to bandpass signals and establish some 
useful analysis tools that will aid our discussions of bandpass transmission. Con- 
sider a real energy signal v, (7) whose spectrum V,,{ £) has the bandpass character- 
istic sketched in Fig. 4.1—2a. This spectrum exhibits hermitian symmetry, because 
Upp(t) is real, but V,(f£) is not necessarily symmetrical with respect to =f. We 
define a bandpass signal by the frequency domain property 


f| T £TW 


which simply states that the signal has no spectral content outside a band of width 
2W centered at f.. The values of f, and W may be somewhat arbitrary, as long as they 
satisfy Eq. (4) with W < f. i 

The corresponding bandpass waveform in Fig. 4.1-2b looks like a sinusoid at 
frequency f, with slowly changing amplitude and phase angle. Formally we write 


vy (t) = A(t) cos [e.t + P(t)] [5] 


where A(f) is the envelope and dt) is the phase, both functions of time. The enve- 
lope, shown as a dashed line, is defined as nonnegative, so that A(t) = 0. Negative 
“amplitudes,” when they occur, are absorbed in the phase by adding +180°. 


Viel f ) 


IV, C£) 





(b) 


Figure 4.1-2 Bandpass signal. [a] Spectrum; (b) waveform. 
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Figure 4.1-3a depicts u,,(t) as a complex-plane vector whose length equals A(t) 
and whose angle equals ct + $(t). But the angular term w,f represents a steady 
counterclockwise rotation at f, revolutions per second and can just as well be sup- 
pressed, leading to Fig. 4.1-3b. This phasor representation, used regularly hereafter, 
relates to Fig. 4.1—3a in the following manner: If you pin the origin of Fig. 4.1-3b 
and rotate the entire figure counterclockwise at the rate f., it becomes Fig. 4.1—3a. 

Further inspection of Fig. 4.1-3a suggests another way of writing v; (t). If we let 


v(t) = A(t) cos &(?) v(t) $ A(z) sin olt) [6] 
then 
Uf) = vt) cos ot — v(t) sin «t [7] 


= vuj(t) cos w,t + u,{t) cos (et + 90?) 


Equation (7) is called the quadrature-carrier description of a bandpass signal, as 
distinguished from the envelope-and-phase description in Eq. (5). The functions 
v;(t) and v,(t) are named the in-phase and quadrature components, respectively. 
The quadrature-carrier designation comes about from the fact that the two terms in 
Eq. (7) may be represented by phasors with the second at an angle of -- 90? com- 
pared to the first. 

While both descriptions of a bandpass signal are useful, the quadrature-carrier 
version has advantages for the frequency-domain interpretation. Specifically, 
Fourier transformation of Eq. (7) yields 


WU) = SU = S) + v pi YF 1) m 





where 
Vif) = Flodt)] — V) = FLv,(t)] 
Up) 
wet + b(t) 
ui) 
(a) (bj 
Figure 4,1-3 (a) Rotating phasor; (b) phasor diagram with rotation suppressed. 
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To obtain Eq. (8) we have used the modulation theorem from Eq. (7), Sect. 2.3, 
along with e=/°” = +j. The envelope-and-phase description does not readily con- 
vert to the frequency domain since, from Eq. (6) or Fig. 4.1-30, 


A(t) = VvXt) + vyt) p(t) = arctan z [9] 





which are not Fourier-transformable expressions. 
An immediate implication of Eq. (8) is that, in order to satisfy the bandpass con- 
dition in Eq. (4), the in-phase and quadrature functions must be lowpass signals with 


Vf) VAL) Oe pee 


In other words, 





We'll capitalize upon this property in the definition of the lowpass equivalent 
spectrum 


Volf) = aV) * JVG)] [10a] 


= V + SAUE + fo) [10b] 


As shown in Fig. 4.1-4, V,,(f) simply equals the positive-frequency portion of 
Vj (f ) translated down to the origin. 
Going from Eq. (10) to the time domain, we obtain the lowpass equivalent signal 


vet) = 9 [Ve (£)] = 30v) + j(0] [tal 


Thus, ve,(t) is a fictitious complex signal whose real part equals 4y,(t) and whose 
imaginary part equals 4v q(t). Alternatively, rectangular-to-polar conversion yields 


Vel f) 





Figure 4.1-4 Lowpass equivalent spectrum. 


—— 


4.1 Bandpass Signals and Systems 


ver) = LAC) ur 


where we've drawn on Eq. (9) to write Vep(t) in terms of the envelope and phase 
functions. The complex nature of the lowpass equivalent signal can be traced back to 
its spectrum V,,(f), which lacks the hermitian symmetry required for the transform 
of a real time function. Nonetheless, v¢,(t) does represent a real bandpass signal. 

The connection between v,,(t) and v,,(t) is derived from Eqs. (5) and (110) as 
follows: 


Valt) = Re {A(t)e iiet ea} [12] 
2 Re [1A(r)e te 189] 
= 2 Re [vs (t)e ^] 


This result expresses the lowpass-to-bandpass transformation in the time domain. 
The corresponding frequency-domain transformation is 
Valf) = Vel f ae fa T V% (-f a fa [13a] 


whose first term constitutes the positive-frequency portion of V,,(f) while the sec- 
ond term constitutes the negative-frequency portion. Since we’ll deal only with real 
bandpass signals, we can keep the hermitian symmetry of V,,(f) in mind and use 
the simpler expression 


Vau r= Vera a [135] 
which follows from Figs. 4.1—2a and 4.1-4. 





Letz(t) = ve, (t)e/" and use 2 Re [z(2)] = z(/) + z*(£) to derive Eq. (13a) from Eq. (12). 


Bandpass Transmission 

Now we have the tools needed to analyze bandpass transmission represented by 
Fig. 4.1—5a where a bandpass signal x,,(t) applied to a bandpass system with trans- 
fer function H,(f) produces the bandpass output y,,(t). Obviously, you could 
attempt direct bandpass analysis via Y, (f) = Hj (f )X,, (f). But it’s usually easier 
to work with the lowpass equivalent spectra related by 


Yet) = Hol f) Xf) [14a] 
where 
Heol f) = Half + f)u(f + f) [145] 


which is the lowpass equivalent transfer function. 
Equation (14) permits us to replace a bandpass system with the lowpass equiva- 
lent model in Fig. 4.1-5b. Besides simplifying analysis, the lowpass model provides 


EXERCISE 4.1-1 
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Xpp(t) —À aun pcm Yuan aD Hof) [eo 


Da 


la] (b) 





Figure 4.1-5 (a) Bandpass system; (b] lowpass model. 


valuable insight to bandpass phenomena by analogy with known lowpass relation- 
ships. We move back and forth between the bandpass and lowpass models with the 
help of our previous results for bandpass signals. 

In particular, after finding Y,,(f ) from Eq. (14), you can take its inverse Fourier 
transform 


yet) = FT Ye(f)] = F Hef )X ool F)] 


The lowpass-to-bandpass transformation in Eq. (12) then yields the output signal 
Yop(t). Or you can get the output quadrature components or envelope and phase 
immediately from y,,(t) as 


yi(t) = 2 Re [ye(t)] y(t) = 2 Im [Yet] [15] 
A(t) = 2 |yep(t)| by(t) = arg [ye(t)] 


which follow from Eq. (10). The example below illustrates an important application 
of these techniques. 





EXAMPLE 4.1-1 Carrier and Envelope Delay 


Consider a bandpass system having constant amplitude ratio but nonlinear phase 
shift 0( f) over its passband. Thus, 


Hue RETS 


and 


Ho he Cer Pe) ge fei i: 


as sketched in Fig. 4.1—6. Assuming the phase nonlinearities are relatively smooth, 
we can write the approximation 


Of + f.) = —2m(tof. + tf) 


where 


OF) 1 d) € 
2Tf. 2*7 df | ff 


This approximation comes from the first two terms of the Taylor series expansion of 


ef + fo). 


[I 
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Figure 4.1-6 


(a) Bandpass transfer Function; (b) lowpass equivalent. 


To interpret the parameters £9 and t,, let the input signal have zero phase so that 
Xpp(t) = A,(t) cos et and x¢,(t) = 5A,(t). If the input spectrum Xp (f) falls 
entirely within the system’s passband, then, from Eq. (14), 

Ye(f) = Ke PUR LC) = Ke Panis fett yy Cf) 
ET Ke "dece X Ure dom 


Recalling the time-delay theorem, we see that the second term corresponds to x,,(t) 
delayed by t,. Hence, 


Yeli) = Ke" "xot — 1) = Ke 3A (r— n) 
and Eq. (12) yields the bandpass output 


Yoplt) ~ KA,{t — 1) cos w(t — to) 


Based on this result, we conclude that fg is the carrier delay while 7, is the envelope 
delay of the system. And since f is independent of frequency, at least to the extent 
of our approximation for 6(f + f.), the envelope has not suffered delay distortion. 
Envelope delay is also called the group delay. 

We'll later describe multiplexing systems in which several bandpass signals at 
different carrier frequencies are transmitted over a single channel. Plots of d0/df 
versus f are used in this context to evaluate the channel’s delay characteristics. If the 


curve is not reasonably flat over a proposed band, phase equalization may be 
required to prevent excessive envelope distortion. 





Sides: a Baud dee system has zero piace shift b but LC = K + (KIF) (f - A 
for fe < f < fa where Ko > (K\/fO(fe — fo). Sketch He (f) taking fe < f. and f, > fo- 
Now show that if xp (t) = A,() cos œt then .the quadrature components of y,,(¢) are 
um K, dae) 
(t) = t E EE 
vil ) 0 x yal ) = 2mf, dt 
provided that X, (f) falls entirely within the bandpass of the system. 
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Tbe simplest bandpass system is the parallel resonant or tuned circuit repre- 


sented by Fig. 4.1—7a. The voltage transfer function plotted in Fig. 4.1—7b can be 
written as 


l 


[17al 
TUM 5) 
i Ee f 


in which the resonant frequency fy and quality factor Q are related to the element 


values by 
I C 
fo—— Q = AE 
j 2n VLC L 


The 3 dB bandwidth between the lower and upper cutoff frequencies is 


Q 
Since practical tuned circuits usually have 10 < Q < 100, the 3 dB bandwidth falls 
between 1 and 10 percent of the center-frequency value. 

A. complete bandpass system consists of the transmission channel plus tuned 
amplifiers and coupling devices connected at each end. Hence, the overall frequency 
response has a more complicated shape than that of a simple tuned circuit. Nonethe- 
less, various physical effects result in a loose but significant connection between the 
system's bandwidth and the carrier frequency f,—similar to Eq. (175). 

For instance, the antennas in a radio system produce considerable distortion 
unless the frequency range is small compared to f.. Moreover, designing a reason- 
ably distortionless bandpass amplifier turns out to be quite difficult if B is either 
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Figure 4.1-7 (a) Tuned circuit; (b) transfer function. 
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very large or very small compared to f.. As a rough rule of thumb, the fractional 
bandwidth B/f, should be kept within the range 


B 
0.01 < - < 0.1 [18] 


c 


Otherwise, the signal distortion may be beyond the scope of practical equalizers. 
From Eq. (18) we see that 





This observation is underscored by Table. 4.1-1, which lists selected carrier fre- 
quencies and the corresponding nominal bandwidth B = 0.02f, for different fre- 
quency bands. Larger bandwidths can be achieved, of course, but at substantially 
greater cost. As a further consequence of Eq. (18), the terms bandpass and narrow- 
band are virtually synonymous in signal transmission. 





Table 4.1-1 Selected carrier frequencies and nominal bandwidth 
Frequency Band Carrier Frequency Bandwidth 
Longwave radio 100 kHz 2 KHz 
Shortwave radio 5 MHz 100 kHz 
VHF 100 MHz 2 MHz 
Microwave 5GHz 100 MHz 
Millimeterwave 100 GHz 2 GHz 
Optical 5 x 10^ Hz 10? Hz 





Bandpass Pulse Transmission 
We found in Sect. 3.4 that transmitting a pulse of duration 7 requires a lowpass band- 
width B = 1/27. We also found in Example 2.3-2 that frequency translation converts 


a pulse to a bandpass waveform and doubles its spectral width. Putting these two 
observations together, we conclude that bandpass pulse transmission requires 


B zm lr 
Since Eq. (18) imposes the additional constraint 0.1f, > B, the carrier frequency 
must satisfy 

Jfa > 10/7 
These relations have long served as useful guidelines in radar work and related 


fields. To illustrate, if T = 1 us then bandpass transmission requires B = 1 MHz 
and f. > 10 MHz. 
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4.2 DOUBLE-SIDEBAND AMPLITUDE MODULATION 


There are two types of double-sideband amplitude modulation: standard amplitude 
modulation (AM), and suppressed-carrier double-sideband modulation (DSB). 
We'll examine both types and show that the minor tbeoretical difference between 
them has major repercussions in practical applications. 


AM Signals and Spectra 


The unique property of AM is that the envelope of the modulated carrier has the 
same shape as the message. If A, denotes the unmodulated carrier amplitude, mod- 
ulation by x(t) produces the modulated envelope 


A(t) = A1 + px(t)] D 


where u is a positive constant called the modulation index. The complete AM sig- 
nal x,(t) is then 


x,t) ^ A,[1 + px(t)] cos wet [2] 
= A, cos w,t + A, x(t) cos at 
Since x,(t) has no time-varying phase, its in-phase and quadrature components are 
xt) = AQ) xa(r) = 0 


as obtained from Eqs. (5) and (6), Sect. 4.1, with @(t) = 0. Actually, we should 
include a constant carrier phase shift to emphasize that the carrier and message 
come from independent and unsynchronized sources. However, putting a constant 
phase in Eq. (2) increases the notational complexity without adding to the physical 
understanding. 

Figure 4.2—1 shows part of a typical message and the resulting AM signal with 
two values of u. The envelope clearly reproduces the shape of x(1) if 


£O W psi [3] 


When these conditions are satisfied, the message x(t) is easily extracted from x,(t) 
by use of a simple envelope detector whose circuitry will be described in Sect. 4.5. 

The condition f, >> W ensures that the carrier oscillates rapidly compared to 
the time variation of x(t); otherwise, an envelope could not be visualized. The con- 
dition u = 1 ensures that A,[1 + ux(t)] does not go negative. With 100-percent 
modulation (u = 1), the envelope varies between Amin = 0 and Ay, = 2A,. Over- 
modulation (u > 1), causes phase reversals and envelope distortion illustrated 
by Fig. 4.2-1c. 

Going to the frequency domain, Fourier transformation of Eq. (2) yields 


XAf) = 3A - f) * AX — 5) F> a 


4.2  Double-Sideband Amplitude Modulation 


x(t) 


Amax = 4c (l + 1) 
Amin = Ac (1 - 2) 









Phase reversal 


| AM 
pl AAT t 
: ss 
(d 
d E igure 42-1 AM waveforms. {a} Message; (b] AM wave with u < 1; (c) ANA Weve with u > 1. 
l where we've written out only the positive-frequency half of X (f). The negative- 
frequency half will be the hermitian image of Eq. (4) since x,(¢) is a real bandpass 


signal. Both halves of X f) are sketched in Fig. 4.2-2 with X(f) from Fig. 4.1-1. 
The AM spectrum consists of carrier-frequency impulses and symmetrical side- 
bands centered at +f, The presence of upper and lower sidebands accounts for 
the name double-sideband amplitude modulation. It also accounts for the AM 


| transmission bandwidth 
Br = 2W (51 
4 Note that AM requires twice the bandwidth needed to transmit x(t) at baseband 
without modulation. 


Transmission bandwidth is an important consideration for the comparison of 
modulation systems. Another important consideration is the average transmitted 


E X power 
! Sp (x46) 
Upon expanding x¿(t) from Eq. (2), we have 

Sp = 3AN1 + 2ux(t) + ux)  2AN(1 + ux(t) f? cos 2o, t) 
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Figure 4.2-2 AM specirum. 


whose second term averages to zero under the condition f, => W. Thus, if 
(x(t)) = O and (x*(t)) = S, then 
Sr = 4A2(1 + p’S,) [6] 


The assumption that the message has zero average value (or no dc component) 
anticipates the conclusion from Sect. 4.5 that ordinary AM is not practical for trans- 
mitting signals with significant low-frequency content. 


We bring out the interpretation of Eq. (6) by putting it in the form 
Sr = P, + 2Py, 
where 
PQ—jADO— Pa = AD'S, = 3| S.P, Ul 
The term P, represents the unmodulated carrier power, since Sr = P, when 
L = 0; the term P,, represents the power per sideband since, when u # 0, Sy con- 


sists of the power in the carrier plus two symmetric sidebands. The modulation con- 
straint |ux(t)| = 1 requires that u25, = 1, so Py = 3P, and 


P. = Sp ~ 2P = 18, Py = }Sr [8] 


Consequently, at least 50 percent of the total transmitted power resides in a carrier 
term that's independent of x(t) and thus conveys no message information. 


DSB Signals and Spectra 


The “wasted” carrier power in amplitude modulation can be eliminated by setting 


= 1 and suppressing the unmodulated carrier-frequency component. The result- 
ing modulated wave becomes 


x(t) = A,x(t) cos w,t [9] 





4.2  Double-Sideband Amplitude Modulation 


which is called double-sideband-suppressed-carrier modulation—or DSB for 
short. (The abbreviations DSB-SC and DSSC are also used.) The transform of Eq. 
(9) is simply 


XAF) =7A-X(f-f) f»0 


and the DSB spectrum looks like an AM spectrum without the unmodulated carrier 
impulses. The transmission bandwidth thus remains unchanged at By = 2W. 
Although DSB and AM are quite similar in the frequency domain, the time- 


domain picture is another story. As illustrated by Fig. 4.2-3 the DSB envelope and 
phase are 


0 x(t) > 0 
+180° x(t) <0 


The envelope here takes the shape of |x(t)|, rather than x(t), and the modulated wave 
undergoes a phase reversal whenever x(t) crosses zero. Full recovery of the mes- 
sage requires knowledge of these phase reversals, and could not be accomplished by 
an envelope detector. Suppressed-carrier DSB thus involves more than just “ampli- 
tude” modulation and, as we’ll see in Sect. 4.5, calls for a more sophisticated 
demodulation process. 

However, carrier suppression does put all of the average transmitted power into 
the information-bearing sidebands. Thus 


A= AOL $= 4 


[10] 


Sr = 2P,, = 4A2S, | [11] 


which holds even when x(f) includes a dc component. From Eqs. (11) and (8) we see 
that DSB makes better use of the total average power available from a given trans- 
mitter. Practical transmitters also impose a limit on the peak envelope power A2... 


x(t) 





Phase reversal 


Figure 4.2-3 DSB waveforms. 
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We'll take account of this peak-power limitation by examining the ratio P,,/A vax 
under maximum modulation conditions. Using Eq. (11) with A max = A, for DSB 
and using Eq. (7) with A max = 2A, for AM, we find that 


nai ci radica ua fce t oot s 


4 
AN T DSB T 


PJA = 
so A max S./16 AMwithu=1 


ETE EAE TE, 
ee a as 


Hence, if A2,, is fixed and other factors are equal, a DSB transmitter produces four 
times the sideband power of an AM transmitter. 

The foregoing considerations suggest a trade-off between power efficiency and 
demodulation methods. 











Consider a radio transmitter rated for Sp = 3 kW and A2,, <= 8 kW. Let the modu- 
lating signal be a tone with A,, = 1 so S, = A2/2 = i. If the modulation is DSB, 
the maximum possible power per sideband equals the lesser of the two values deter- 


mined from Eqs. (11) and (12). Thus 


EXAMPLE 4.2-1 


Py =48pS15KW Po = GA2,, = 1.0kW 


which gives the upper limit P,, = 1.0 kW. 

If the modulation is AM with u = 1, then Eq. (12) requires that 
Py = A2,,/32 0.25 KW. To check on the average-power limitation, we note from 
Eq. (7) that Pa = P./4 so Sr = P, + 2Py = 6P,, and Po = S7/6 = 0.5 kW. 
Hence, the peak power limit again dominates and the maximum sideband power is 
P. = 0.25 kW. Since transmission range is proportional to Pp, the AM path length 
would be only 25 percent of the DSB path length with the same transmitter. : 


PLE to PE TOW ra laa a nu FSR SCATTER aaa D ab o ENE a ca STO EO ttd qa C Sov aE TT Da a i acr Tz 


EXERCISE 4.2-1 Let the modulating signal be a square wave that switches periodically between 
x(t) = +1 and x(t) = —1. Sketch x(t) when the modulation is AM with u = 0.5, 
AM with u = 1, and DSB. Indicate the envelopes by dashed lines. 





Suppose a voice signal has |x(r)| max = 1 and S, = 1/5. Calculate the values of Sy 
and AŽ „x needed to get P,, = 10 W for DSB and for AM with u = 1. 





EXERCISE 4.2-2 
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Figure 4.2-4 Line spectra for tone modulation. (a) DSB; (b] AM. 


Tone Modulation and Phasor Analysis 
Setting x(t) = Am cos w,,f in Eq. (9) gives the tone-modulated DSB waveform 
X(t) = A Am COS Wnt cos w,t [13a] 


A, A 
= TO cos (w, — «,,)t + — 





An 
j COS (c, + @,)t 


where we have used the trigonometric expansion for the product of cosines. Similar 
expansion of Eq. (2) yields the tone-modulated AM wave 








A-MAnm 
xt) = A, cos ext + j COS (a, — W_)t + cos (w, + @,_)t [13b] 


A-MAm 
2 
Figure 4.2-4 shows the positive-frequency line spectra obtained from Eqs. (13a) 

and (13D). 

It follows from Fig. 4.2-4 that tone-modulated DSB or AM can be viewed as a 
sum of ordinary phasors, one for each spectral line. This viewpoint prompts the use 
of phasor analysis to find the envelope-and-phase or quadrature-carrier terms. Pha- 
sor analysis is especially helpful for studying the effects of transmission distortion, 
interference, and so on, as demonstrated in the example below. 





Let's S cm fee case atte tone- Eme AM i we = § fore convenience. >. The pha: EXAMPLE 4.2-2 
sor diagram is constructed in Fig. 4.2—5a by adding the sideband phasors to the tip 
of the horizontal carrier phasor. Since the carrier frequency is f,, the sideband pha- 
sors at f. = f, rotate with speeds of +f, relative to the carrier phasor. The resultant 
of the sideband phasors is seen to be colinear with the carrier, and the phasor sum 
equals the envelope A1 + $ cos ct). 
But suppose a transmission channel completely removes the lower sideband, so 
we get the diagram in Fig. 4.2-5b. Now the envelope becomes 


A(t) = [(A, + $A, cos Omt)? + GA, sin Ont) 


= AVY + i COS ot 
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i A, sin oy 





(b) 
Figure 4.2-5 Phasor diagrams for Example 4.2-2. 


from which the envelope distortion can be determined. Also note that the transmis- 
sion amplitude distortion has produced a time-varying phase $(t). 





EXERCISE 4.2-3 Draw the phasor diagram for tone-modulated DSB with A,, = 1. Then find A(t) and 


p(t) when the amplitude of the lower sideband is cut in half. 





4.3 MODULATORS AND TRANSMITTERS 


The sidebands of an AM or DSB signal contain new frequencies that were not pres- 
ent in the carrier or message. The modulator must therefore be a time-varying or 
nonlinear system, because LTI systems never produce new frequency components. 
This section describes the operating principles of modulators and transmitters that 
employ product, square-law, or switching devices. Detailed circuit designs are given 
in the references cited in the Supplementary Reading. . 


Product Modulators 


Figure 4.3-1a is the block diagram of a product modulator for AM based on 
the equation x,(t) = A, cos w,t + ux(t)A, cos ct. The schematic diagram in 
Fig. 4.3-1b implements this modulator with an analog multiplier and an op-amp 
summer. Of course, a DSB product modulator needs only the multiplier to produce 
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Multiplier 


ux) —— (X) (E) x( 





A. COS Wnt 
{a) 
x(t) 
x(t) 
(b) 
Figure 4.3-1 (a) Product modulator for AM; {b} schematic diagram with analog multiplier. 

[ 
| Figure 4.3-2 Circuit for variable transconductance multiplier. 
i x(t) = x(t)A, cos wet. In either case, the crucial operation is multiplying two ana- 


log signals. 


Analog multiplication can be carried out electronically in a number of different 

| ways. One popular integrated-circuit design is the variable transconductance mul- 
tiplier illustrated by Fig. 4.3—2. Here, input voltage v, is applied to a differential 

amplifier whose gain depends on the transconductance of the transistors which, in 

[ turn, varies with the total emitter current. Input v; controls the emitter current by 
means of a voltage-to-current converter, so the differential output equals Ku;v;. 

| Other circuits achieve multiplication directly with Hall-effect devices, or indirectly 
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Nonlinear 
element 






x(t) Filter 


COS Wt 





(b) 


Figure 4.3-3 (a) Square-law modulator; (b) FET circuit realization. 


with log and antilog amplifiers arranged to produce antilog (log v, + log v;) = 
U;U,. However, most analog multipliers are limited to low power levels and rela- 
tively low frequencies. 


Square-Law and Balanced Modulators 

Signal multiplication at higher frequencies can be accomplished by the square-law 
modulator diagrammed in Fig. 4.3-3a. The circuit realization in Fig. 4.3-3 uses a 
field-effect transistor as the nonlinear element and a parallel RLC circuit as the fil- 
ter. We assume the nonlinear element approximates the square-law transfer curve 


= 2 
Vout = AVin + A2Vin 


Thus, with v,,(t) = x(t) + cos @,t, 


2a 
Vout) = ayx(t) + a5x?(t) + a, cos’w,t + afi + POI cosw,t [1] 
1 


The last term is the desired AM wave, with A, = a, and u = 2a,/a,, provided it 
can be separated from the rest. 

As to the feasibility of separation, Fig. 4.3-4 shows the spectrum 
Valf) = F[Vo (t)] taking X(f) as in Fig. 4.1-1. Note that the x?(t) term in Eq. (1) 
becomes X * X(f), which is bandlimited in 2W. Therefore, if f. > 3W, there is no 
spectral overlapping and the required separation can be accomplished by a bandpass 
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Figure 4.3-4 Spectral components in Eq. [1]. 








A, [1 ^ x(£)] cos w,t 
Y x(t) — —5- i 


A, COS wt 


( + )——— 7 x(DA, cos ext 


Figure 4.3-5 
Balanced 
modulator. 


— b x(t) ———— 
A, [1 — ^ x(£)] cos et 


filter of bandwidth By = 2W centered at f.. Also note that the carrier-frequency 
impulse disappears and we have a DSB wave if a, = 0—corresponding to the per- 
fect square-law curve Vou = a; v2. 

Unfortunately, perfect square-law devices are rare, so high-frequency DSB is 
obtained in practice using two AM modulators arranged in a balanced configuration 
to cancel out the carrier. Figure 4.3-5 shows such a balanced modulator in block- 
diagram form. Assuming the AM modulators are identical, save for the reversed sign 

of one input, the outputs are A,[1 + 3x(r)] cos w,t and A,[1 — $x(t)] cos w,t. Sub- 
. tracting one from the other yields x.(f) = x(t)A, cos w,t, as required. Hence, a bal- 
anced modulator is a multiplier. You should observe that if the message has a dc 
term, that component is not canceled out in the modulator, even though it appears at 
the carrier frequency in the modulated wave. 

Another modulator that is commonly used for generating DSB signals is the 
ring modulator shown in Fig. 4.3-6. A square-wave carrier c(t) with frequency f, 
causes the diodes to switch on and off. When c(t) > 0, the top and bottom diodes 
are switched on, while the two inner diodes in the cross-arm section are off. In this 
case, Vou = x(t). Conversely, when c(t) < 0, the inner diodes are switched on and 
the top and bottom diodes are off, resulting in Vow = —x(t). Functionally, the ring 
modulator can be thought of as multiplying x(t) and c(t). However because c(t) is a 
periodic function, it can be represented by a Fourier series expansion. Thus 


4 4 4 
Vout) = LX) cos w,t — 37) cos 3w,t + s. cos 5@ 4 =: 
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Figure 4.3-6 Ring modulator. 


Observe that the DSB signal can be obtained by passing v,,,(t) through a bandpass 
filter having bandwidth 2W centered at f,. This modulator is often referred to as a 
double-balanced modulator since it is balanced with respect to both x(t) and c(t). 
A balanced modulator using switching circuits is discussed in Chap. 6 under the 
heading of bipolar choppers. Other circuit realizations can be found in the literature. 





EXERCISE 4.3-1 Suppose the AM modulators in Fig. 4.3—5 are constructed with identical nonlinear 
elements having Uou = aiU; + a502, + au}. Take uj, = +x(t) + A, cos wt and 
show that the AM signals have second-harmonic distortion but, nonetheless, the 
final output is undistorted DSB. 





Switching Modulators 


In view of the heavy filtering required, square-law modulators are used primarily for 
low-level modulation, i.e., at power levels lower than the transmitted value. Sub- 
stantial linear amplification is then necessary to bring the power up to S7. But RF 
power amplifiers of the required linearity are not without problems of their own, and 
it often is better to employ high-level modulation if S; is to be large. 

Efficient high-level modulators are arranged so that undesired modulation prod- 
ucts never fully develop and need not be filtered out. This is usually accomplished 
with the aid of a switching device, whose detailed analysis is postponed to Chap. 6. 
However, the basic operation of the supply-voltage modulated class C amplifier is 
readily understood from its idealized equivalent circuit and waveforms in Fig. 4.3-7. 

The active device, typically a transistor, serves as a switch driven at the carrier 
frequency, closing briefly every 1/f. seconds. The RLC load, called a tank circuit, 
is tuned to resonate at f., so the switching action causes the tank circuit to “ring” 
sinusoidally. The steady-state load voltage in absence of modulation is then 
u(t) = V cos w, t. Adding the message to the supply voltage, say via transformer, 
gives u(t) = [V + Nx(t)] cos c, t, where N is the transformer turns ratio. If V and 
N are correctly proportioned, the desired modulation has been accomplished with- 
out appreciable generation of undesired components. 


Figure 4.3-7 


Figure 4.3-8 


4.3 Modulators and Transmitters 163 
— + 
o Tank 
Active device circuit 
1:N — mA à 
Je 
y= v(t) 


u(t) 








(b) 


Class C amplifier with supply-voltage modulation. (a) Equivalent circuit; (b) out 
put waveform. 
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AM transmitter with high-level modulation. 


A complete AM transmitter is diagrammed in Fig. 4.3—8 for the case of high- 
leve] modulation. The carrier wave is generated by a crystal-controlled oscillator to 
ensure stability of the carrier frequency. Because high-level modulation demands 
husky input signals, both the carrier and message are amplified before modulation. 
The modulated signal is then delivered directly to the antenna. 
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4.4  SUPPRESSED-SIDEBAND AMPLITUDE 
MODULATION 


Conventional amplitude modulation is wasteful of both transmission power and 
bandwidth. Suppressing the carrier reduces the transmission power. Suppressing 
one sideband, in whole or part, reduces transmission bandwidth and leads to single- 
sideband modulation (SSB) or vestigial-sideband modulation (VSB) discussed in 
this section. 


SSB Signals and Spectra 


The upper and lower sidebands of DSB are uniquely related by symmetry about 
the carrier frequency, so either one contains all the message information. Hence, 
transmission bandwidth can be cut in half if one sideband is suppressed along with 
the carrier. 

Figure 4.4-1a presents a conceptual approach to single-sideband modulation. 
Here, the DSB signal from a balanced modulator is applied to a sideband filter that 
suppresses one sideband. If the filter removes the lower sideband, the output spectrum 


| Bal | DSB | Sideband 
x(t) mod filter SSB 
bee 











cos wt 
(a) 
Xf) 
N USSB 
"m fe " 
b) tp ese 
XC) 
LSSB 
mis - L No f 
- f NG 0 fe- W fe 


r — w — 


Figure 4.4-1 Single-sideband modulation. (a) Modulator; (b) USSB spectrum; (c) 1558 
spectrum. 
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Xf) consists of the upper sideband alone, as illustrated by Fig. 4.4-1b. We'll label 
this a USSB spectrum to distinguish it from the LSSB spectrum containing just the 
lower sideband, as illustrated by Fig. 4.4—1c. The resulting signal in either case has 


Br=W Sp=P, = 1425, (1 


which follow directly from our DSB results. 

Although SSB is readily visualized in the frequency domain, the time-domain 
description is not immediately obvious—save for the special case of tone modula- 
tion. By referring back to the DSB line spectrum in Fig. 4.4—4a, we see that remov- 
ing one sideband line leaves only the other line. Hence, 


x(t) = $A,A, cos (o, + cy [2] 


in which the upper sign stands for USSB and the lower for LSSB, a convention 
employed hereafter. Note that the frequency of a tone-modulated SSB wave 1s offset 
from f, by +f, and the envelope is a constant proportional to A,,. Obviously, enve- 
lope detection won't work for SSB. 

To analyze SSB with an arbitrary message x(t), we'll draw upon the fact that the 
sideband filter in Fig. 4.4-la is a bandpass system with a bandpass DSB input 
Xp (t) = A_x(t) cos w,t and a bandpass SSB output y, (t) = x.(t). Hence, we'll 
find x,(t) by applying the equivalent lowpass method from Sect. 4.1. Since x(t) 
has no quadrature component, the lowpass equivalent input is simply 


Xep(t) a iA) X ef) = 5A X(f) 


The bandpass filter transfer function for USSB is plotted in Fig. 4.4—2a along with 
the equivalent lowpass function 


Hf) = Hy(f + f)u(f +f) = u(f) = u(f — W) 
The corresponding transfer functions for LSSB are plotted in Fig. 4.4-2b, where 
Hof) = u(f + W) — ulf) 
Both lowpass transfer functions can be represented by 
HAf)=20 +smf) |fisW [3] 


You should confirm for yourself that this rather strange expression does include both 
parts of Fig. 4.4—2. 


Multiplying H,,(f) and X4(f) yields the lowpass equivalent spectrum for 
either USSB or LSSB, namely 


Ylf) =4A(1 = sgnf)X(£) - iAQX() = (senf)X(f)] 


Now recall that (~j sgn f)X(f) = #[x(t)], where x(t) is the Hilbert transform of 
x(t) defined in Sect. 3.5. Therefore, 9* !((sgn f)X(f)] = jx(t) and 


yet) = aAdx(t) 3: j$(0] 
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Figure 4.4-2 Ideal sideband filters and lowpass equivalents. (a) USSB; (b) LSSB. 


Finally, we perform the lowpass-to-bandpass transformation x,(t) = y,,(t) = 
2 Re[ye(t)e/"^ to obtain 
x(t) = 4A [x(t) cos o, t * X(t) sin c, t] [4] 

This is our desired result for the SSB waveform in terms of an arbitrary message 
x(t). 

Closer examination reveals that Eq. (4) has the form of a quadrature-carrier 
expression. Hence, the in-phase and quadrature components are 

Xei(t) = 2Acx(t) xat) = 52A) 

while the SSB envelope is 


A(t) = 2A, V27(t) + 2°(2) [5] 


The complexity of Eqs. (4) and (5) makes it a difficult task to sketch SSB wave- 
forms or to determine the peak envelope power. Instead, we must infer time-domain 
properties from simplified cases such as tone modulation or pulse modulation. 


SSB with Pulse Modulation 


Whenever the SSB modulating signal has abrupt transitions, the Hilbert transform 
x(t) contains sharp peaks. These peaks then appear in the envelope A(t), giving rise 
to the effect known as envelope horns. To demonstrate this effect, let's take the rec- 
tangular pulse x(t) = u(t) — u(t — 7) so we can use X(t) found in Example 3.5-2. 
The resulting SSB envelope plotted in Fig. 4.4—3 exhibits infinite peaks at ¢ = O and 
t — 7, the instants when x(f) has stepwise discontinuities. Clearly, a transmitter 
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Horns 





Figure 4.4-3 Envelope of SSB with pulse modulation. 


couldn’t handle the peak envelope power needed for these infinite horns. Also note 
the smears in A(t) before and after each peak. 
We thus conclude that 








Show that Eqs. (4) and (5) agree with Eq. (2) when x(t) = Am cos w,, t so x(t) = EXERCISE 4.4-1 
Å m SIN Opn f. 





SSB Generation 


Our conceptual SSB generation system (Fig. 4.4—1a) calls for the ideal filter func- 
tions in Fig. 4.4—2. But a perfect cutoff at f = f, cannot be synthesized, so a real 
sideband filter will either pass a portion of the undesired sideband or attenuate a por- 
tion of the desired sideband. (Doing both is tantamount to vestigial-sideband modu- 
lation.) Fortunately, many modulating signals of practical interest have little or no 
low-frequency content, their spectra having “holes” at zero frequency as shown in 
Fig. 4.4—4a. Such spectra are typical of audio signals (voice and music), for exam- 
ple. After translation by the balanced modulator, the zero-frequency hole appears as 
a vacant space centered about the carrier frequency into which the transition region 
of a practical sideband filter can be fitted. Figure 4.4—4b illustrates this point. 

As a rule of thumb, the width 28 of the transition region cannot be much 
smaller than 1 percent of the nominal cutoff frequency, which imposes the limit 
| feo < 2008. Since 28 is constrained by the width of the spectral hole and f., should 

equal f., it may not be possible to obtain a sufficiently high carrier frequency with a 
given message spectrum. For these cases the modulation process can be carried out 
| in two (or more) steps using the system in Fig. 4.4—5 (see Prob. 4.4—5). 


| 
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Figure 4.4-4 (a) Message spectrum with zero-frequency hole; (b) practical sideband filter. 


x(t) 





Figure 4.4-5 Two-step SSB generation. 


Another method for SSB generation is based on writing Eq. (4) in the form 


Ac Aog 
x(t) = 5 X cos et + 5 Ai) cos (w,t — 90?) [6] 
This expression suggests that an SSB signal consists of two DSB waveforms with 
quadrature carriers and modulating signals x(t) and x(t). Figure 4.4-6 diagrams a 
system that implements Eq. (6) and produces either USSB or LSSB, depending 
upon the sign at the summer. This system, known as the phase-shift method, 
bypasses the need for sideband filters. Instead, the DSB sidebands are phased such 
that they cancel out on one side of f. and add on the other side to create a single- 
sideband output. l 

However, the quadrature phase shifter Ho(f ) is itself an unrealizable network 
that can only be approximated — usually with the help of additional but identical 
phase networks in both branches of Fig. 4.4—6. Approximation imperfections gener- 
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Figure 4.4-6 Phase-shift method for SSB generation. 
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Figure 4.4-7 Weaver's SSB modulator. 


ally cause low-frequency signal distortion, and the phase-shift system works best 
with message spectra of the type in Fig. 4.4—4a. A third method for SSB generation 


that avoids both sideband filters and quadrature phase shifters is considered in 
Example 4.4-2. 
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Weaver's SSB Modulator 


Consider the modulator in Fig. 4.4—7 taking x(t) = cos 27 fn t with fa < W. Then 
x(t) = v, + v, where v, is the signal from the upper part of the loop and v; is from 
the lower part. Taking these separately, the input to the upper LPF is 
cos 24r fn t COS 27 it The output of LPFI is multiplied by cos. 24r (f f, + By, 
resulting in v; = 4[ cos 2a(f, + ¥ — ¥ Ye) + cos 2r(f, + $+ $ — fatl. 
The input to the lower LPF is cos 277 n un Qa $t. D output of LPF? i is multi- 
plied by sin ale = + Py, resulting in v, = t[cos2r(f, + $- B + fin) 
i> cos Zar Ue +¥ E Taking the upper signs x,(t) = 2 X } cos 
2m(f. d , pe + cos (œ, + @m)t, which corresponds to USSB. Similarly, 


we achieve LSSB by taking the lower signs, resulting in x,(t) = $ cos (@, — c)t 
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Take x(t) = cos wt in Fig. 4.4—6 and confirm the sideband cancellation by sketch- 
ing line spectra at appropriate points. 





VSB Signals and Spectra* 


Consider a modulating signal of very large bandwidth having significant low- 
frequency content. Principal examples are television video, facsimile, and high-speed 
data signals. Bandwidth conservation argues for the use of SSB, but practical SSB 
systems have poor low-frequency response. On the other hand, DSB works quite well 
for low message frequencies but the transmission bandwidth is twice that of SSB. 
Clearly, a compromise modulation scheme is desired; that compromise is VSB. 

VSB is derived by filtering DSB (or AM) in such a fashion that one sideband is 
passed almost completely while just a trace, or vestige, of the other sideband is 
included. The key to VSB is the sideband filter, a typical transfer function being that 
of Fig. 4.4—8a. While the exact shape of the response is not crucial, it must have odd 
symmetry about the carrier frequency and a relative response of 1/2 at f,. Therefore, 
taking the upper sideband case, we have 


Af) = ulf- f) Af -f) fro [7a] 
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Figure 4.4-8 VSB filter characteristics. 
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where 


H-f) -Hf) and  Hyf)0  |f|^»B [7b] 
as shown in Fig. 4.4-8b. 
The VSB filter is thus a practical sideband filter with transition width 28. 


Because the width of the partial sideband is one-half the filter transition width, the 
transmission bandwidth is 


BQ,-Wc-Bew [8] 


However, in some applications the vestigial filter symmetry 1s achieved primarily at 
the receiver, so the transmission bandwidth must be slightly larger than W + f. 

When B << W, which is usually true, the VSB spectrum looks essentially like 
an SSB spectrum. The similarity also holds in the time domain, and a VSB wave- 
form can be expressed as a modification of Eq. (4). Specifically, 


x(t) = $A[x(f) cos wt — x(t) sin c, t] [9a] 
where x,(t) is the quadrature message component defined by 


x(t) = X(t) + xg(t) [95] 
with 


B 
X(t) = 32 Hef )X(f )e?" df [9c] 
-B 


If 8 < W, VSB approximates SSB and x,(t) ~ 0; conversely, for large B, VSB 
approximates DSB and x(t) + x,(t) = 0. The transmitted power 57 is not easy to 
determine exactly, but is bounded by 


FA2S, S S7 SHAS, [10] 


depending on the vestige width f. 

Finally, suppose an AM wave is applied to a vestigial sideband filter. This mod- 
ulation scheme, termed VSB plus carrier (VSB + C), is used for television video 
transmission. The unsuppressed carrier allows for envelope detection, as.in AM, 
while retaining the bandwidth conservation of suppressed sideband. Distortionless 
envelope modulation actually requires symmetric sidebands, but VSB + C can 
deliver a fair approximation. 

To analyze the envelope of VSB + C, we incorporate a carrier term and modu- 
lation index u in Eq. (9) which becomes 


x(t) = A,{[1 + wx(t)] cos ext — uxj(t) sin etj [11] 
The in-phase and quadrature components are then 
x«t) = Ad + px(t) | xm) x A, xt) 


so the envelope is A(t) = [x2,(t) + x2,(7)]? or 
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A(t) = A19 ZORE + ae yt [12] 


Hence, if u is not too large and f not too small, then | x,(t)| « land 
A(t) ~A{1 + ua] 


as desired. Empirical studies with typical signals are needed to find values for u and 
B that provide a suitable compromise between the conflicting requirements of dis- 
tortionless envelope modulation, power efficiency, and bandwidth conservation. 





4.5 FREQUENCY CONVERSION AND DEMODULATION 


Linear CW modulation—be it AM, DSB, SSB, or VSB—produces upward transla- 
tion of the message spectrum. Demodulation therefore implies downward fre- 
quency translation in order to recover the message from the modulated wave. 
Demodulators that perform this operation fall into the two broad categories of syn- 
chronous detectors and envelope detectors. 

Frequency translation, or conversion, is also used to shift a modulated signal to 
a new carrier frequency (up or down) for amplification or other processing. Thus, 
translation is a fundamental concept in linear modulation systems and includes 
modulation and detection as special cases. Before examining detectors, we’ll look 
briefly at the general process of frequency conversion. 


Frequency Conversion 


Frequency conversion starts with multiplication by a sinusoid. Consider, for exam- 
ple, the DSB wave x(t) cos ot. Multiplying by cos œt, we get 


x(t) cos ct cos œt = 3x(t) cos (ve, + @2)t + 3x(f) cos (o, — œt H 


The product consists of the sum and difference frequencies, f, + f; and |f, — f], 
each modulated by x(t). We write |f, — f;| for clarity, since cos (w, — @,)i = 
cos (o, — w,)t. Assuming f; 7 fı, multiplication has translated the signal spectra to 
two new carrier frequencies. With appropriate filtering, the signal is up-converted or 
down-converted. Devices that carry out this operation are called frequency con- 
verters or mixers. The operation itself is termed heterodyning or mixing. 

Figure 4.5—1 diagrams the essential components of a frequency converter. 
Implementation of the multipiier follows the same line as the modulator circuits dis- 
cussed in Sect. 4.3. Converter applications include beat-frequency oscillators, 
regenerative frequency dividers, speech scramblers, and spectrum analyzers, in 
addition to their roles in transmitters and receivers. 
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EXAMPLE 4.5-1 Figure 4.5-2 represents a simplified transponder in a satellite relay that provides 
two-way communication between two ground stations. Different carrier frequencies, 
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Multiplier 
Oscillator 
Figure 4.5-1 Frequency converter. 
6 GHz | 2 GHz | 4 GHz 
AA NNN 
4 GHz 7 6 GHz 
Figure 4.5-2 Satellite transponder with frequency conversion. 


6 GHz and 4 GHz, are used on the uplink and downlink to prevent self-oscillation due 
to positive feedback from the transmitting side to the receiving side. A frequency 


converter translates the spectrum of the amplified uplink signal to the passband of the 
downlink amplifier. 





EXERCISE 4.5-1 


Sketch the spectrum of Eq. (1) for f < fi, f; = fj, and f, > ff, taking X(f) as in Fig. 
4.1-1. 





Synchronous Detection 


All types of linear modulation can be detected by the product demodulator of 
Fig. 4.5-3. The incoming signal is first multiplied with a locally generated sinusoid 
and then lowpass-filtered, the filter bandwidth being the same as the message band- 
width W or somewhat larger. It is assumed that the local oscillator (LO) is exactly 
synchronized with the carrier, in both phase and frequency, accounting for the name 
synchronous or coherent detection. 


For purposes of analysis, we'll write the input signal in the generalized form 
x(t) = EUER, x(t)] cos wt ~ K, x(t) sin wet [2] 


which can represent any type of linear modulation with proper identification of 
Ka K,, and x,(t)—ic., take K, = 0 for suppressed carrier, x,(t) = 0 for double 
sideband, and so on. The filter input is thus the product 
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A19 COS wf 


Figure 4.5-3 Synchronous product detection. 


x(t)Aro cos eot 
A 
=F ((K + K,x(¢)] + [K. + Ky 2(1)] cos 2o,t — K, x(t) sin 20t} 


Since f, > W, the double-frequency terms are rejected by the lowpass filter, leaving 
only the leading term 


yp(t) = Kp[K, + Ka x(t)] [3] 


where Kp is the detection constant. The DC component Kp K. corresponds to the 
translated carrier if present in the modulated wave. This can be removed from the 
output by a blocking capacitor or transformer—which also removes any DC term in 
x(t) as well. With this minor qualification we can say that the message has been fully 
recovered from x,(t). 

Although perfectly correct, the above manipulations fail to bring out what goes 
on in the demodulation of VSB. This is best seen in the frequency domain with the 
message spectrum taken to be constant over W (Fig. 4.5—4a) so the modulated spec- 
trum takes the form of Fig. 4.5—4b. The downward-translated spectrum at the filter 
input will then be as shown in Fig. 4.5—4c. Again, high-frequency terms are elimi- 
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Figure 4.5-4 VSB spectra. (a) Message; (b] modulated signal; (c] frequency-translated signal 
before lowpass filtering. 





4.5 Frequency Conversion and Demodulation 


x,(t) + pilot carrier 


(x) LPF 





Figure 4.5-5 Homodyne detection. 


nated by filtering, while the down-converted sidebands overlap around zero fre- 
quency. Recalling the symmetry property of the vestigial filter, we find that the por- 
tion removed from the upper sideband is exactly restored by the corresponding ves- 
tige of the lower sideband, so X(f) has been reconstructed at the output and the 
detected signal is proportional to x(7). 

Theoretically, product demodulation borders on the trivial; in practice, it can be 
rather tricky. The crux of the problem is synchronization—synchronizing an oscil- 
lator to a sinusoid that is not even present in the incoming signal if carrier is sup- 
pressed. To facilitate the matter, suppressed-carrier systems may have a small 
amount of carrier reinserted in x(t) at the transmitter. This pilot carrier is picked 
off at the receiver by a narrow bandpass filter, amplified, and used in place of an LO. 
The system, shown in Fig. 4.5—5, is called homodyne detection. (Actually, the 
amplified pilot more often serves to synchronize a separate oscillator rather than 
being used directly.) 

A variety of other techniques are possible for synchronization, including 
phase-lock loops (to be covered in Sect. 7.3) or the use of highly stable, crystal- 
controlled oscillators at transmitter and receiver. Nonetheless, some degree of asyn- 
chronism must be expected in synchronous detectors. It is therefore important to 
investigate the effects of phase and frequency drift in various applications. This 
we'll do for DSB and SSB in terms of tone modulation. 

Let the local oscillator wave be cos (w,t + w't + $'), where w’ and ¢’ repre- 
sent slowly drifting frequency and phase errors compared to the carrier. For double 
sideband with tone modulation, the detected signal becomes 


yp(t) = Kp cos ot cos (c't + $') [4] 


» ! 
a [cos (wm + w')t + cos (o, — w’)t] ¢' =0 


Kp cos w,,t cos $' w = 0 
Similarly, for single sideband with x(t) = cos (w, + ,,)t, we get 
yolt) = Kp cos [ost = (w't + 6')] [5] 


- D cos(m, F w')t  $'-0 
Kp cos (mt + $') o'-0 
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All of the foregoing expressions come from simple trigonometric expansions. 

Clearly, in both DSB and SSB, a frequency drift that's not small compared to W 
will substantially alter the detected tone. The effect is more severe in DSB since a 
pair of tones, fa + f' and f, — f', is produced. If f’ < f, this sounds like war- 
bling or the beat note heard when two musical instruments play in unison but 
slightly out of tune. While only one tone is produced with SSB, this too can be dis- 
turbing, particularly for music transmission. To illustrate, the major triad chord con- 
sists of three notes whose frequencies are related as the integers 4, 5, and 6. Fre- 
quency error in detection shifts each note by the same absolute amount, destroying 
the harmonic relationship and giving the music an East Asian flavor. (Note that the 
effect is not like playing recorded music at the wrong speed, which preserves the 
frequency ratios.) For voice transmission, subjective listener tests have shown that 
frequency drifts of less than +10 Hz are tolerable, otherwise, everyone sounds 
rather like Donald Duck. 

As to phase drift, again DSB is more sensitive, for if 6’ = 90? (LO and car- 
rier in quadrature), the detected signal vanishes entirely. With slowly varying ¢’, we 
get an apparent fading effect. Phase drift in SSB appears as delay distortion, the 
extreme case being when $' = +90° and the demodulated signal becomes X(t). 
However, as was remarked before, the human ear can tolerate sizeable delay distor- 
tion, so phase drift is not so serious in voice-signal SSB systems. 

To summarize, 





Envelope Detection 


Very little was said earlier in Sect. 4.5 about synchronous demodulation of AM for 
the simple reason that it's almost never used. True, synchronous detectors work for 
AM, but so does an envelope detector, which is much simpler. Because the enve- 
lope of an AM wave has the same shape as the message, independent of carrier fre- 
quency and phase, demodulation can be accomplished by extracting the envelope 
with no worries about synchronization. 

A simplified envelope detector and its waveforms are shown in Fig. 4. Ra 
where the diode is assumed to be piecewise-linear. In absence of further circuitry, 
the voltage v would be just the half-rectified version of the input via But R,C; acts 
as a lowpass filter, responding only to variations in the peaks of v;, provided that 


l 
y < — — «x 6] 
y RC, JY. [ 
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4.5 Frequency Conversion and Demodulation 


U 








Uout 


(b) 


Figure 4.5-6 Envelope detection {a} Circuit; (b) waveforms. 


Thus, as noted earlier, we need f. >> W so the envelope is clearly defined. Under 
these conditions, C, discharges only slightly between carrier peaks, and v approxi- 
mates the envelope of Via More sophisticated filtering produces further improve- 
ment if needed. Finally, R,C, acts as a dc block to remove the bias of the unmodu- 
lated carrier component. Since the dc block distorts low-frequency message 
components, conventional envelope detectors are inadequate for signals with impor- 
tant low-frequency content. 

The voltage v may also be filtered to remove the envelope variations and pro- 
duce a dc voltage proportional to tbe carrier amplitude. This voltage in turn is fed 
back to earlier stages of the receiver for automatic volume control (AVC) to com- 
pensate for fading. Despite the nonlinear element, Fig. 4.5-6 is termed a linear 
envelope detector; the output is linearly proportional to the input envelope. Power- 
law diodes can also be used, but then v will include terms of the form v2, v?,, and so 
on, and there may be appreciable second-harmonic distortion unless u << 1. 

Some DSB and SSB demodulators employ the method of envelope recon- 
struction diagrammed in Fig. 4.5—7. The addition of a large, locally generated car- 
rier to the incoming signal reconstructs the envelope for recovery by an envelope 
detector. This method eliminates signal multiplication but does not get around the 
synchronization problem, for the local carrier must be as well synchronized as the 
LO in a product demodulator. 
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Figure 4.5-7 Envelope reconstruction for suppressed-carrier modulation. 





Let the input in Fig. 4.5—7 be SSB with tone modulation, and let the LO have a 
phase error $' but no frequency error. Use a phasor diagram to obtain an expression 
for the resulting envelope. Then show that A(t) ~ Aro + 3A, Am cos (w,t = d$) 
if Aio © AL Åm 


EXERCISE 4.5-2 











4.6 PROBLEMS 


4.1-1 Use a phasor diagram to obtain expressions for u,(t),u,(t), A(t), and (t) when 
Up (f) = u;(t) cos w,t + v(t) cos (vt + a). Then simplify A(t) and P(r) assum- 
ing |] < |vi(r).. 

4.1-2 Do Prob. 4.1-1 with vj, (t) = v(t) cos (v, — wo)t + vX(t) cos (w, + wo)t. 


4.1-3 Let v(t) and v,(t) in Eq. (7) be lowpass signals with energy E; and E}, respectively, 
and bandwidth W < f.. (a) Use Eq. (17), Sect. 2.2, to prove that 


| Usp tat = 0 
(b) Now show that the bandpass signal energy equals (E; + E,)/2. 
4.1-4* Find ve (t), u{t) and v (t) when f, = 1200 Hz and 


1 900 = |f| < 1300 
V = 
ool) lo otherwise 
4.1-5 Do Prob. 4.1-4 with 
1 1100 = |f| < 1200 
Valf) =41/2 1200 x |f| < 1350 
0 otherwise 


4.1-6 Let u,,(t) = 2z(t) cos [(@, + wo)t + a]. Find v(t) and v,(t) to obtain 
valt) = 2(t) exp j(teot + a) 


4.1-7 Derive Eq. (17b) by obtaining expressions for fy and f, from Eq. (17a). 





4.1-8 


4.1-9 


4.1-10* 


4.1-11 


4.1-12t 
4.1-13t 


4.2-1 
4.2-2 
4.2-3* 
4.2-4 
4.2-5 
4.2-6 


4.2-7 
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Let f = (1 + ô)f in Eq. (17a) and assume that |8| << 1. Derive the handy approx- 
imation 


H(f) = VI + j2Q0 — f)/fo] 
which holds for f > O and |f — p| < fy. 


A stagger-tuned bandpass system centered at f = f, has H(f) = 2Hi(f)HXf). 
where H,(f ) is given by Eq. (17a) with f; = f. — b and Q = fy/2b while H;(f) is 
given by Eq. (17a) with fy = f. + b and Q = f,/2b. Use the approximation in Prob. 
4.1-8 to plot |H(f)| for f, — 2b < f < f, + 2b and compare it with a simple tuned 
circuit having f; = f. and B = 2bv/2. 

Use lowpass time-domain analysis to find and sketch y,,(t) when x,,(t) = 


A cos ot u(t) and H,(f) = 1/[1 + j2(f — f.)/B] for f > 0, which corresponds 
to the tuned-circuit approximation in Prob. 4.1-8. 


Do Prob. 4.1-10 with H,,(f) = II[(f£ — f.)/Ble 7" for f > 0, which corresponds 
to an ideal BPF. Hint: See Eq. (9), Sect. 3.4. 


The bandpass signal in Prob. 4.1-6 has z(t) = 2u(t) and is applied to an ideal 
BPF with unit gain, zero time delay, and bandwidth B centered at f.. Use lowpass 


frequency-domain analysis to obtain an approximation for the bandpass output sig- 
nal when B << fo. 


Consider a BPF with bandwidth B centered at f., unit gain, and parabolic phase shift 
O(f) = (f — f. Y/b for f > 0. Obtain a quadrature-carrier approximation for the 
output signal when |b| >> (B/2)? and x, (t) = z(t) cos w,t, where z(t) has a band- 
limited lowpass spectrum with W = 8. 


Let x(t) = cos 27f,t u(t) with f, « f. Sketch x,(t) and indicate the envelope 


when the modulation is AM with u < 1, AM with u > 1, and DSB. Identify loca- 
tions where any phase reversals occur. 


Do Prob. 4.2-1 with x(t) = O.5u(t) — 1.5u(t — T) with T > 1/f.. 
If x(t) = cos 20071, find B; and S; for the AM modulated signal assuming A, = 10 
and u = 0.6. Repeat for DSB transmission. 


The signal x(t) = sinc? 40f is to be transmitted using AM with u < 1. Sketch the 
double-sided spectrum of x,(t) and find By. 


Calculate the transmitted power of an AM wave with 100 percent tone modulation 
and peak envelope power 32 kW. 


Consider a radio transmitter rated for 4 kW peak envelope power. Find the maxi- 
mum allowable value of u for AM with tone modulation and Sy = 1 kW. 


The multitone modulating signal x(t) = 3K(cos 871 + 2 cos 2051) is input to an 
AM transmitter with u = 1 and f, = 1000. Find K so that x(t) is properly normal- 
ized, draw the positive-frequency line spectrum of the modulated wave, and calcu- 
late the upper bound on 2P,/;. 
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Do Prob. 4.2-7 with x(t) = 2K(cos 87 + 1) cos 20r. 


The signal x(t) = 4 sin 51 is transmitted by DSB. What range of carrier frequencies 
can be used? 


The signal in Prob. 4.2-9 is transmitted by AM with u = 1. Draw the phasor dia- 


gram. What is the minimum amplitude of the carrier such that phase reversals don't 
occur? 


The signal x(t) = cos 27401 + 3 cos 24790t is transmitted using DSB. Sketch the 
MEM line spectrum and the phasor diagram. 


The signal x(t) = 3 cos 2070t + $ cos 21120: is input to the square-law modulator 
system given in Fig. 4.3—3a (p. 160) with a carrier frequency of 10 kHz. Assume 
Vout = diUjg + "d (a) Give the center frequency and bandwidth of the filter such 
that this system will produce a Standard AM signal. (b) Determine values of à; and 
a, such that A, = 10 and u = 4 


A modulation system with nonlinear elements produces the signal x,(t) = 
aK*(v(t) + A cos wt)? — b(v(t) — A cos «,t)?. If the carrier has frequency f, and 
u(t) = x(t), show that an appropriate choice of K produces DSB modulation with- 
out filtering. Draw a block diagram of the modulation system. 


Find K and v(t) so that the modulation system from Prob. 4.3-2 produces AM with- 
out filtering. Draw a block diagram of the modulation system. 


A modulator similar to the one in Fig. 4.3—3a (p. 160) has a nonlinear element of the 
form Von = aiU; + 2303, Sketch V ulf) for the input signal in Fig. 4.1-1 (p. 143). 
Find the parameters of the oscillator and BPF to produce a DSB signal with carrier 
frequency f,. 


Design in block-diagram form an AM modulator using the nonlinear element from 
Prob. 4.3-4 and a frequency doubler. Carefully label all components and find a 
required condition on f, in terms of W to realize this system. 


Find the output signal in Fig. 4.3—5 (p. 161) when the AM modulators are unbal- 
anced, so that one nonlinear element has Vow = a,U,, + aU, + avi, while the 
other has uy, = b,u,, + bjv2, + bv}. 


The signal x(t) = 20sinc? 400: is input to the ring modulator in Fig. 4.3-6 (p. 162). 
Sketch the spectrum of Vouw and find the range of values of f. that can be used to 
transmit this signal. 


Derive Eq. (4) from ye,(t). 
Take the transform of Eq. (4) to obtain the SSB spectrum 


Xf) =aA-f[1 = sgn(f — £2]X( — £) + [1 7 sgnüf - £)1XC  /2)- 


Confirm that the expression for X( f) in Prob. 4.4—2 agrees with Figs. 4.4-1b and 
4.4—1c (p. 164). 
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Find the SSB envelope when x(t) = cos œt + $ cos 3o,t which approximates a 
triangular wave. Sketch A(r) taking A, = 81 and compare with x(t). 


The system in Fig. 4.4-5 produces USSB with f. = f, + fa when the lower cutoff 
frequency of the first BPF equals f, and the lower cutoff frequency of the second 
BPF equals f. Demonstrate the system's operation by taking X(f) as in Fig. 4.4—4a 
and sketching spectra at appropriate points. How should the system be modified to 
produce LSSB? 


Suppose the system in Fig. 4.4—5 is designed for USSB as described in Prob. 4.4—5. 
Let x(f) be a typical voice signal, so X(f) has negligible content outside 
200 < |f| < 3200 Hz. Sketch the spectra at appropriate points to find the maxi- 
mum permitted value of f. when the transition regions of the BPFs must satisfy 
28 = 0.01f,,. 


The signal x(t) = cos 27100 + 3 cos 27200 + 2 cos 27400t is input to an 
LSSB amplitude modulation system with a carrier frequency of 10 kHz. Sketch the 
double-sided spectrum of the transmitted signal. Find the transmitted power $7 and 
bandwidth By. 


Draw the block diagram of a system that would generate the LSSB signal in Prob. 
4.4—7, giving exact values for filter cutoff frequencies and oscillators. Make sure 
your filters meet the fractional bandwidth rule. l 


Suppose the carrier phase shift in Fig. 4.4—6 is actually —90? + ô, where ô is a 
small angular error. Obtain approximate expressions for x,(t) and A(z) at the output. 


Obtain an approximate expression for x,{t) at the output in Fig. 4.4-6 when 
x(t) = cos, and the quadrature phase shifter has |Ho(f,,)| = 1 — € and 
arg Ho( fn) = —90° + ô, where e and 6 are small errors. Write your answer as a 
sum of two sinusoids. 


The tone signal x(t) = A,, cos 27f,,¢ is input to a VSB + C modulator. The result- 
ing transmitted signal is 
xt) = A, cos 2r ft + $aA,A, cos [2v(f. + f.)t] 
+ 3(1 — a)A,A, cos [2n(f. — f.)t]. 
Sketch the phasor diagram assuming a > 1. Find the quadrature component 3o): 


Obtain an expression for VSB with tone modulation taking f, « B so the VSB fil- 


ter has H(f, = fa) = 0.5 + a. Then show that x,(t) reduces to DSB when a = 0 
or SSB when a = £0.5. 


Obtain an expression for VSB + C with tone modulation taking fa > f. Construct 
the phasor diagram and find A(t). 


Given a bandpass amplifier centered at 66 MHz, design a television transponder that 
receives a signal on Channel 11 (199.25 MHz) and transmits it on Channel 4 (67.25 
MHz). Use only one oscillator. 


182 


4.5-2 


4,5-3 


4.5-4 


4.5-5* 


4.5-6 


4.5-7 


4.5-8 


CHAPTER 4 € Linear CW Modulation 


Do Prob. 4.5-1 with the received signal on Channel 44 (651.25 MHz) and the trans- 
mitted signal on Channel 22 (519.25 MHz). 


The system in Fig. 4.4—5 becomes a scrambler when the first BPF passes only the 
upper sideband, the second oscillator frequency is f; = f| + W, and the second BPF 
is replaced by an LPF with B = W. Sketch the output spectrum taking X( f) as in 
Fig. 4.4-4a, and explain why this output would be unintelligible when x(t) is a voice 
signal. How can the output signal be unscrambled? 


Take x(t) as in Eq. (2) and find the output of a synchronous detector whose local 
oscillator produces 2 cos (cot + p), where ¢ is a constant phase error. Then write 
separate answers for AM, DSB, SSB, and VSB by appropriate substitution of the 
modulation parameters. 


The transmitted signal in Prob. 4.4—11 is demodulated using envelope detection. 
Assuming 0 = a = 1, what values of a minimize and maximize the distortion at the 
output of the envelope detector? 


The signal x(t) = 2 cos 47t is transmitted by DSB. Sketch the output signal if 
envelope detection is used for demodulation. 


Suppose the DSB waveform from Prob. 4.5—6 is demodulated using a synchronous 
detector that has a square wave with a fundamental frequency of f. as the local oscil- 
lator. Will the detector properly demodulate the signal? Will the same be true if peri- 
odic signals other than the square wave are substituted for the oscillator? 


Sketch a half-rectified AM wave having tone modulation with vA, = 1 and 
tm = W. Use your sketch to determine upper and lower limits on the time constant 
RC, of the envelope detector in Fig. 4.5—6. From these limits find the minimum 
practical value of f,/W. 
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wo properties of linear CW modulation bear repetition at the outset of this chapter: the modulated spectrum is 

basically the translated message spectrum and the transmission bandwidth never exceeds twice the message 
bandwidth. A third property, derived in Chap. 10, is that the destination signaHo-noise ratio (S/N)}p is no better than 
baseband transmission and can be improved only by increasing the transmitted power. Exponential modulation 
ditfers on all three counts. 

In contrast to linear modulation, exponential modulation is a nonlinear process; therefore, it should come as no 
surprise that the modulated spectrum is not related in a simple fashion to the message spectrum. Moreover, it turns out 
that the transmission bandwidth is usually much greater than twice the message bandwidth. Compensating for the 
bandwidth liability is the fact that exponential modulation can provide increased signalto-noise ratios without 
increased transmitted power. Exponential modulation thus allows you to trade bandwidth for power in the design of 
a communication system. 

We begin our study of exponential modulation by defining the two basic types, phase modulation (PM) and 
frequency modulation (FM). We'll examine signals and spectra, investigate the transmission bandwidth and distor 
tion problem, and describe typical hardware for generation and detection. The analysis of interference at the end of the 
chapter brings out the value of FM for radio broadcasting and sets the stage for our consideration of noise in Chap. 10. 


OBJECTIVES 


After studying this chapter and working the exercises, you should be able to do each of the following: 


Find the instantaneous phase and frequency of a signal with exponential modulation (Sect. 5.1). 

Construct the line spectrum and phasor diagram for FM or PM with tone modulation (Sect. 5.1). 

Estimate the bandwidth required for FM or PM transmission (Sect. 5.2). 

Identify the effects of distortion, limiting, and frequency multiplication on an FM or PM signal (Sect. 5.2). 
Design an FM generator and detector appropriate for an application (Sect. 5.3). 

Use a phasor diagram to analyze interference in AM, FM, and PM (Sect. 5.4). 


a A Perte Dac 


5.1 PHASE AND FREQUENCY MODULATION 


This section introduces the concepts of instantaneous phase and frequency for the 
definition of PM and FM signals. Then, since the nonlinear nature of exponential 
modulation precludes spectral analysis in general terms, we must work instead with 


the spectra resulting from particular cases such as narrowband modulation and tone 
modulation. 


PM and FM Signals 


Consider a CW signal with constant envelope but time-varying phase, so 


x(t) = A,cos[w,t + (t)] [n 





5.1 Phase and Frequency Modulation 


Upon defining the total instantaneous angle 


0t) = wt + olt) 


we can express x,(t) as 
x(t) = A,cos 6,(t) = A, Re [e/99] 


Hence, if 0,(t) contains the message information x(t), we have a process that may be 
termed either angle modulation or exponential modulation. We'll use the latter 
name because it emphasizes the nonlinear relationship between x,(t) and x(t). 


As to the specific dependence of 6,(t) on x(t, phase modulation (PM) is 
defined by 


b(t) = dax(t) bg = 180° [2] 
so that 


x(t) = A, cos [ot + dyx(t)] [3] 


These equations state that the instantaneous phase varies directly with the modulat- 
ing signal. The constant @, represents the maximum phase shift produced by x(t), 
since we're still keeping our normalization convention |x(#)| 1. The upper bound 
pa = 180° (or r radians) limits (7) to the range — 180? and prevents phase ambi- 
guities—after all, there's no physical distinction between angles of +270 and —90°, 
for instance. The bound on $4 is analogous to the restriction u = 1 in AM, and $4 
can justly be called the phase modulation index, or the phase deviation. 

The rotating-phasor diagram in Fig. 5.1—1 helps interpret phase modulation and 
leads to the definition of frequency modulation. The total angle 0,(t) consists of the 
constant rotational term w,t plus (t), which corresponds to angular shifts relative to 


the dashed line. Consequently, the phasor's instantaneous rate of rotation in cycles 
per second will be 





E Lm dos 
fupe BU ego e [4] 








Figure 5.1-1 Rotating-phasor representation of exponential modulation. 
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in which the dot notation stands for the time derivative, that is, f(t) = do(t)/dt, and 
so on. We call f(t) the instantaneous frequency of x,(t). Although f(t) is measured 
in hertz, it should not be equated with spectral frequency. Spectral frequency fis the 
independent variable of the frequency domain, whereas instantaneous frequency f(t) 
is a time-dependent property of waveforms with exponential modulation. 

In the case of frequency modulation (FM), the instantaneous frequency of the 
modulated wave is defined to be 


f0S5£5-fwD IR [5] 


so f(t) varies in proportion with the modulating signal. The proportionality constant 
fa, called the frequency deviation, represents the maximum shift of f(¢) relative to 
the carrier frequency f,. The upper bound f, < f, simply ensures that f(t) > 0. How- 
ever, we usually want fa << f. in order to preserve the bandpass nature of x (7). 

Equations (4) and (5) show that an FM wave has (1) = 27f, x(t), and integra- 
tion yields the phase modulation 


(t) = 2nfs| x(A) dA + olto) t = ty [6a] 


If fp is taken such that (t) = 0, we can drop the lower limit of integration and use 
the informal expression 


t 


b(t) = 2nf. | x(A) dà [6b] 


The FM waveform is then written as 


x(t) = A, cos |o. + 25, | x) aa [7] 


But it must be assumed that the message has no de component so the above integrals 
do not diverge when t — co. Physically, a dc term in x(t) would produce a constant 
carrier-frequency shift equal to f,(x(t)). 

A comparison of Eqs. (3) and (7) implies little difference between PM and FM, 
the essential distinction being the integration of the message in FM. Moreover, 
nomenclature notwithstanding, both FM and PM have both time-varying phase and 
frequency, as underscored by Table 5.1—1. These relations clearly indicate that, with 
the help of integrating and differentiating networks, a phase modulator can produce 
frequency modulation and vice versa. In fact, in the case of tone modulation it’s 
nearly impossible visually to distinguish FM and PM waves. 

On the other hand, a comparison of exponential modulation with linear modu- 
lation reveals some pronounced differences. For one thing, 
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Table 5.1-1 Comparison of PM and FM 





Instantaneous phase @(t) Instantaneous frequency f(t) 
1 y 
PM bax(t) fe + a bait) 
TF 
t 
FM 2f. | x(A) da Jfa + faxt) 





Therefore, regardless of the message x(t), the average transmitted power is 


Sr = $A? [8] 


For another, the zero crossings of an exponentially modulated wave are not periodic, 
whereas they are always periodic in linear modulation. Indeed, because of the 
constant-amplitude property of FM and PM, it can be said that 





Finally, since exponential modulation is a nonlinear process, 





Figure 5.1-2 illustrates some of these points by showing typical AM, FM, and 
PM waves. As a mental exercise you may wish to check these waveforms against the 
corresponding modulating signals. For FM and PM this is most easily done by consid- 
ering the instantaneous frequency rather than by substituting x(£) in Eqs. (3) and (7). 

Despite the many similarities of PM and FM, frequency modulation turns out to 
have superior noise-reduction properties and thus will receive most of our attention. 
To gain a qualitative appreciation of FM noise reduction, suppose a demodulator 
simply extracts the instantaneous frequency f(t) = f. + fax(t) from x(t). The 
demodulated output is then proportional to the frequency deviation fa, which can be 
increased without increasing the transmitted power $7. If the noise level remains 
constant, increased signal output is equivalent to reduced noise. However, noise 
reduction does require increased transmission bandwidth to accommodate large fre- 
quency deviations. 

Ironically, frequency modulation was first conceived as a means of bandwidth 
reduction, the argument going somewhat as follows: If, instead of modulating the 
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Figure 5.1-2 Illustrative AM, FM, and PM waveforms. 


carrier amplitude, we modulate the frequency by swinging it over a range of, say, 
+50 Hz, then the transmission bandwidth will be 100 Hz regardless of the message 
bandwidth. As we'll soon see, this argument has a serious flaw, for it ignores the dis- 
tinction between instantaneous and spectral frequency. Carson (1922) recognized 
the fallacy of the bandwidth-reduction notion and cleared the air on that score. 
Unfortunately, he and many others also felt that exponential modulation had no 
advantages over linear modulation with respect to noise. It took some time to over- 
come this belief but, thanks to Armstrong (1936), the merits of exponential modula- 
tion were finally appreciated. Before we can understand them quantitatively, we 
must address the problem of spectral analysis. 





Suppose EM had been defined in direct analogy to AM by writing x.) = 
A, cos w(t) t with w(t) = w,[1 + ux(t)]. Demonstrate the physical impossibility 
of this definition by finding f(t) when x(t) = cos c,, t. 


EXERCISE 5.1-1 






Narrowband PM and FM 


Our spectral analysis of exponential modulation starts with the quadrature-carrier 
version of Eq. (1), namely 


X(t) = x, (f) cos e.t — x (0) sin ot [9] 


where 


xt) = A, cos $(t) = A, -E+ | [19] 
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NET 1 
Xo) = A, sin P(t) = Ay (t) T 31 f(t) ee 
Now we impose the simplifying condition 
\p(t)| < 1 rad [11a] 
so that 
Xet) cá Ac xot) os, A(t) 01 15] 


Then it becomes an easy task to find the spectrum X,(f) of the modulated wave in 
terms of an arbitrary message spectrum X( f). 
Specifically, the transforms of Eqs. (9) and (115) yield 


XQ)-;A8f-1)*2A9(-f) f»0 tmd 
in which 
Canna [&3X0) ™ 
SUDO] ri FM JM 


The FM expression comes from the integration theorem applied to $(ft) in Eq. (6). 

Based on Eq. (12), we conclude that if x(t) has message bandwidth W «& f., 
then x,(t) will be a bandpass signal with bandwidth 2W. But this conclusion holds 
only under the conditions of Eq. (11). For larger values of |ó(?)|, the terms ¢°(0), 
(D, . . . cannot be ignored in Eq. (10) and will increase the bandwidth of x,(t). 
Hence, Eqs. (11) and (12) describe the special case of narrowband phase or fre- 
quency modulation (NBPM or NBFM). 





An informative illustration of Eq. (12) is provided by taking x(t) = sinc 2Wt, so 
X(f) = (1/2W)II(f/2W). The resulting NBPM and NBFM spectra are depicted in 
Fig. 5.1-3. Both spectra have carrier-frequency impulses and bandwidth 2W. How- 
ever, the lower sideband in NBFM is 180? out of phase (represented by the negative 
sign), whereas both NBPM sidebands have a 90? phase shift (represented by j). 
Except for the phase shift, the NBPM spectrum looks just like an AM spectrum with 
the same modulating signal. 


EXAMPLE 5.1-1 





Tone Modulation 


The study of FM and PM with tone modulation can be carried out jointly by the sim- 
ple expedient of allowing a 90° difference in the modulating tones. For if we take 
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Figure 5.1-3 Narrowband modulated spectra with x(f = sinc 2Wt. (a) PM: (b) FM. 


(f) 7: sin w,,¢ PM 
Xx = 
A, COS o, FM 


then Eqs. (2) and (6) both give 


p(t) = B sino, [13a] 
where 
A $4A PM 
2 b] 
p T FM i: 


The parameter f serves as the modulation index for PM or FM with tone modula- 
tion. This parameter equals the maximum phase deviation and is proportional to the 
tone amplitude A,, in both cases. Note, however, that 9 for FM is inversely propor- 
tional to the tone frequency f, since the integration of cos c, yields (sin v, t)/c,,. 
Narrowband tone modulation requires 8 << 1, and Eq. (9) simplifies to 


x,{t) = A, cos o, t — A,B sin Ont sin ot 


A.B 


[14] 
A.B 





= Å, COS Qt — 





cos (W, — «,)t + cos (w, + w,,)t 

The corresponding line spectrum and phasor diagram are shown in Fig. 5.14. 
Observe how the phase reversal of the lower sideband line produces a component 
perpendicular or quadrature to the carrier phasor. This quadrature relationship is 


precisely what's needed to create phase or frequency modulation instead of ampli- 
tude modulation. 
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(a) (b) 


Figure 5.1-4 





NBFM with tone modulation. (a) Line spectrum; (b) phasor diagram. 


Now, to determine the line spectrum with an arbitrary value of the modulation 
index, we drop the narrowband approximation and write 


x(t) = A,[cos P(t) cos v, t — sin (t) sin ot] (15] 


y 


= A [cos (B sin w,,t) cos w,t — sin (B sin w,,t) sin c, t] 


Then we use the fact that, even though x,(1) is not necessarily periodic, the terms 
cos (B sin w,,t) and sin (B sin œmt) are periodic and each can be expanded as a 


trigonometric Fourier series with fy = f,,. Indeed, a well-known result from applied 
mathematics states that 


| 
ul 


EE Rr oor anm eii € x 


cos (B sin e, t) = Jo(B) + > 2 J,(B) cos nont [16] 


neven 


sin(B sin w,,t) = 5 2 Ja (B) sin nont 


n odd 
where n is positive and 


= | APRES aX [17] 


lip 


JB) 


2r 


sg: 


The coefficients J„(8) are Bessel functions of the first kind, of order n and argument 
B. With the aid of Eq. (17), you should encounter little difficulty in deriving the 
trigonometric expansions given in Eq. (16). 


Substituting Eq. (16) into Eq. (15) and expanding products of sines and cosines 


finally yields 
| x(t) = A. Jo(B) cos ot [18a] 
+ WA.J(B)[cos (v; + no,)t — cos (e, — no,)t] 
|- n odd 


+ S A. J,(B)[cos (we + no,)t + cos (o, — NW,)t] 


neven 
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Alternatively, taking advantage of the property that J ,(8) = ( — 1)'J,(B), we get 
the more compact but less informative expression 


x(t) =A, X, J(B) cos (em, + no,)t [18b] 
In either form, Eq. (18) is the mathematical representation for a constant-amplitude 
wave whose instantaneous frequency varies sinusoidally. A phasor interpretation, to 
be given shortly, will shed more light on the matter. 
Examining Eq. (18), we see that 





A typical spectrum is illustrated in Fig. 5.1-5. Note that negative frequency compo- 
nents will be negligible as long as Bf, << f.. In general, the relative amplitude of a 
line at f. + nf, is given by J,(B), so before we can say more about the spectrum, we 
must examine the behavior of Bessel functions. 

Figure 5.1-6a shows a few Bessel functions of various order plotted versus the 
argument B. Several important properties emerge from this plot. 


1. The relative amplitude of the carrier line J9(B) varies with the modulation index 
and hence depends on the modulating signal. Thus, in contrast to linear modu- 
lation, the carrier-frequency component of an FM wave "contains" part of the 
message information. Nonetheless, there will be spectra in which the carrier 
line has zero amplitude since J9(8) = 0 when B = 2.4, 5.5, and so on. 


Jo(B) 








Se fot Sm 


fe + fin 
—J,(B) 


Figure 5.1-5 Line spectrum of FM with tone modulation. 
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The number of sideband lines having appreciable relative amplitude also 
depends on 8. With 8 < 1 only J, and J, are significant, so the spectrum will 
consist of carrier and two sideband lines as in Fig. 5.1—4a. But if 6 > 1, there 
will be many sideband lines, giving a spectrum quite unlike linear modulation. 


Large 6 implies a large bandwidth to accommodate the extensive sideband 
structure, agreeing with the physical interpretation of large frequency deviation. 


Jn(B) 














{b} 


Figure 5.1-6 Plots of Bessel functions. (a) Fixed order n, variable argument B; (b) fixed argu- 


ment B, variable order n. 
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Table 5.1-2 Selected values of J,(8] 
J,(0.1) J,(0.2) J,(0.5) J,(1.0) J,0-0 — J,(5.0) J,(10) 


n n 
0 1.00 0.99 0.94 0.77 0.22 -018 . -025 0 
1 0.05 0.10 0.24 0.44 0.558 —0.33 0.04 1 
2 0.03 0.11 0.35 0.05 0.25 2 
3 0.02 0.13 0.36 0.06 3 
4 l 0.03 039  —0.22 4 
5 026 | —-023 5 
6 013 —0.01 6 
1 0.05 0.22 7 
8 0.02 0.32 8 
9 0.29 9 
10 0.21 10 
ll 012 1 
12 0.06 12 
13 0.03 13 
14 0.01 14 





Some of the above points are better illustrated by Fig. 5.1-6b, which gives J,(B) 
as a function of n/B for various fixed values of D. These curves represent the 
"envelope" of the sideband lines if we multiply the horizontal axis by Bf, to obtain the 
line position nf, relative to f.. Observe in particular that all J,(8) decay monotonically 
for n/B > 1 and that |J,(B)| < 1if|n/B| >> 1. Table 5.1-2 lists selected values 
of J,(B), rounded off at the second decimal place. Blanks in the table correspond to 
\7,(B)| < 0.01. 

Line spectra drawn from the data in Table 5.1-2 are shown in Fig. 5.1-7, omitting 
the sign inversions. Part a of the figure has B increasing with f, held fixed, and applies 
to FM and PM. Part b applies only to FM and illustrates the effect of increasing B by 
decreasing fa with Am fa held fixed. The dashed lines help bring out the concentration 
of significant sideband lines within the range f. + Bfn as B becomes large. 

For the phasor interpretation of x,(1) in Eq. (18), we first return to the narrow- 
band approximation and Fig. 5.1—4. The envelope and phase constructed from the 
carrier and first pair of sideband lines are seen to be 


2 2 2 
A(t) = Ja i3 (28 4.sin ont =~ adi + P" = E oos 20, t 
2(B/2)A, sin «t 
A 





p(t) ^ arctan | | = B sin wnt 


C 


Thus the phase variation is approximately as desired, but there is an additional 
amplitude variation at twice the tone frequency. To cancel out the latter we should 
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Figure 5.1-7 Tone-modulated line spectra. (a) FM or PM with f, fixed; (b) FM with Amh fixed. 


include the second-order pair of sideband lines that rotate at +2f, relative to the car- 
rier and whose resultant is collinear with the carrier. While the second-order pair 
virtually wipes out the undesired amplitude modulation, it also distorts $(f). The 
phase distortion is then corrected by adding the third-order pair, which again intro- 
duces amplitude modulation, and so on ad infinitum. 

When all spectral lines are included, the odd-order pairs have a resultant in 
quadrature with the carrier that provides the desired frequency modulation plus 
unwanted amplitude modulation. The resultant of the even-order pairs, being 
collinear with the carrier, corrects for the amplitude variations. The net effect is then 
as illustrated in Fig. 5.1-8. The tip of the resultant sweeps through a circular arc 
reflecting the constant amplitude A,. 





The narrowband FM signal x,(t) = 100 cos [27 5000t + 0.05 sin 27r 2002] is EXAMPLE 5.1-2 
transmitted. To find the instantaneous frequency f(t) we take the derivative of 0(7) 
fl) = —6() 
2 


= = [2v 5000 + 0.05(27: 200) cos 2 200 1 
Tr 


= 5000 + 10 cos 27 200 t 
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From f(t) we determine that f, = 5000 Hz, f, = 10, and x(t) = cos 27r200t. There 
are two ways to find B. For NBFM with tone modulation we know that 
f(t) = B sin wat. Since x(t) = A,cos(w,t + A(t)], we can see that B = 0.05. 
Alternatively we can calculate 
Am 
p= "EL 

From f(t) we find that A,, f, = 10 and fẹ, = 200 so that 6 = 10/200 = 0.05 just as 
we found earlier. The line spectrum has the form of Fig. 5.1-4a with A, = 100 and 
sidelobes A,8/2 = 2.5. The minor distortion from the narrowband approximation 
shows up in the transmitted power. From the line spectrum we get Sp = 1(—2.5)? + 
5(100)? + 1(2.5)? = 5006.25 versus Sy = $42? = 3(100)? = 5000 when there are 
enough sidelobes so that there is no amplitude distortion. 






EXERCISE 5.1-3 Consider tone-modulated FM with A, = 100, Amfa = 8 KHz, and f, = 4 KHz. 
Draw the line spectrum for f, = 30 kHz and for f, = 11kHz. 





Even-order 


Odd-order sidebands 


sidebands 


Figure 5.1-8 FM phasor diagram for arbitrary B. 


Multitone and Periodic Modulation* 


The Fourier series technique used to arrive at Eq. (18) also can be applied to the case 
of FM with multitone modulation. For instance, suppose that x(t) = A, cos «t + 
A» cos wt, where f, and f, are not harmonically related. The modulated wave is first 
written as 








TUE 
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x(t) = A,[(cos a, cos a — sin o, sin œz) cos wet 
—(sin a, cos o; + cos o, sin o) sin wet] 


where a, = f,sin ot, B, = A, fa/fi and so on. Terms of the form cos a, sin o, 
and so on, are then expanded according to Eq. (16), and after some routine manipu- 
lations we arrive at the compact result 


x(t) - A, €. > J,(Bi)Jn(B2) cos (o, + no, + mw,)t [19] 
This technique can be extended to include three or more nonharmonic tones; the 
procedure is straightforward but tedious. 

To interpret Eq. (19) in the frequency domain, the spectral lines can be divided 
into four categories: (1) the carrier line of amplitude A, J(B,) Jo(B2); (2) sideband 
lines at f. + nf, due to one tone alone; (3) sideband lines at f. + mf, due to the other 
tone alone; and (4) sideband lines at f. + nf, + mf, which appear to be beat- 
frequency modulation at the sum and difference frequencies of the modulating tones 
and their harmonics. (This last category would not occur in linear modulation where 
simple superposition of sideband lines is the rule.) A double-tone FM spectrum 
showing the various types of spectral lines is given in Fig. 5.1-9 for fi << f, and 
B, > B2. Under these conditions there exists the curious property that each sideband 
line at f. + mf, looks like another FM carrier with tone modulation of frequency fi. 

When the tone frequencies are harmonically related—meaning that x(t) is a 
periodic waveform—then (ft) is periodic and so is e/*. The latter can be expanded 
in an exponential Fourier series with coefficients 


1 
Cc, = =| exp j[@(t) — not] dt [20a] 
To R 
Therefore 
x(t) = A Re| Ss cet [20b] 


and A, |c,| equals the magnitude of the spectral line at f = f. + nfo. 
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Figure 5.1-9 Double-tone FM line spectrum with f| < h and 8, > Be. 
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EXAMPLE 5.1-3 EM with Pulse-Train Modulation p 


Let x(t) be a unit-amplitude rectangular pulse train with period 7p, pulse duration 7, 
and duty cycle d = T/T,. After removing the dc component (x(t)) = d, the instan- 
taneous frequency of the resulting FM wave is as shown in Fig. 5.1-10a. The time 
origin is chosen such that $(t) plotted in Fig. 5.1-10b has a peak value $4 = 2mfaT 
at t = 0. We've also taken the constant of integration such that $(t) = 0. Thus 


Olt) = { 


pali + t/r) epe 
pall — t/(T> — 7)] Dur cm 
which defines the range of integration for Eq. (20a). 


The evaluation of c, is a nontrivial exercise involving exponential integrals and 
trigonometric relations. The final result can be written as 





7 [ss "(B -n)d _ (1- d) sin v(8 — n)d le imped 
€. = a(B — n) mB — n)d + mn 
H O (PEE R 


where we've let 


B = falo = falfo 


which plays a role similar to the modulation index for single-tone modulation. 
Figure 5.1-10c plots the magnitude line spectrum for the case of d = 1/4, 
B = 4, and A, = 1. Note the absence of symmetry here and the peaking around 


fO 
fet e — fp 
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Figure 5.1-10 FM with pulse-train modulation. (a) Instantaneous frequency; (b) phase; (c) line spectrum for d = 1/4. 
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fafa- tfa and f =f, + 7fa, the two values taken on by the instantaneous fre- 
quency. The fact that the spectrum contains other frequencies as well underscores 
the difference between spectral frequency and instantaneous frequency. The same 
remarks apply for the continuous spectrum of FM with a single modulating pulse— 
demonstrated by our results in Example 2.5-1. 





5.2 TRANSMISSION BANDWIDTH AND DISTORTION 


The spectrum of a signal with exponential modulation has infinite extent, in general. 
Hence, generation and transmission of pure FM requires infinite bandwidth, 
whether or not the message is bandlimited. But practical FM systems having finite 
bandwidth do exist and perform quite well. Their success depends upon the fact that, 
sufficiently far away from the carrier frequency, the spectral components are quite 
small and may be discarded. True, omitting any portion of the spectrum will cause 
distortion in the demodulated signal; but the distortion can be minimized by keeping 
all significant spectral components. 

We'll formulate in this section estimates of transmission bandwidth require- 
ments by drawing upon results from Sect. 5.1. Then we’ll look at distortion pro- 
duced by linear and nonlinear systems. Topics encountered in passing include the 
concept of wideband FM and that important piece of FM hardware known as a lim- 
iter. We'll concentrate primarily on FM, but minor modifications make the analyses 
applicable to PM. 


Transmission Bandwidth Estimates 


Determination of FM transmission bandwidth boils down to the question: How 
much of the modulated signal spectrum is significant? Of course, significance stan- 
dards are not absolute, being contingent upon the amount of distortion that can be 
tolerated in a specific application. However, rule-of-thumb criteria based on studies 
of tone modulation have met with considerable success and lead to useful approxi- 
mate relations. Our discussion of FM bandwidth requirements therefore begins with 
the significant sideband lines for tone modulation. 

Figure 5.1-6 indicated that J,(8) falls off rapidly for |n/8| > 1, particularly if 
B >> 1. Assuming that the modulation index £ is large, we can say that |J,(8)| is 
significant only for |n| = B = A,,f,/f,, Therefore, all significant lines are con- 
tained in the frequency range f. + Bf, — f. + Amfa, a conclusion agreeing with 
intuitive reasoning. On the other hand, suppose the modulation index is small; then 
all sideband lines are small compared to the carrier, since Jj(B) 2^ J,«0(8) when 
B < 1. But we must retain at least the first-order sideband pair, else there would be 


no frequency modulation at all. Hence, for small f, the significant sideband lines are 
contained in f, + f, 


To put the above observations on a quantitative footing, all sideband lines hav- 
ing relative amplitude |J,(8)| > e are defined as being significant, where e ranges 
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from 0.01 to 0.1 according to the application. Then, if |J,(8)| > € and 
lJu+i(B)| < €, there are M significant sideband pairs and 2M + 1 significant lines 
all told. The bandwidth is thus written as 


B-2M(Bf,  M(B)=1 (1) 


since the lines are spaced by f,, and M depends on the modulation index B. The con- 
dition M(B) = 1 has been included in Eq. (1) to account for the fact that B cannot be 
less than 2f,. 

Figure 5.2-1 shows M as a continuous function of B for e = 0.01 and 0.1. 
Experimental studies indicate that the former is often overly conservative, while the 
latter may result in small but noticeable distortion. Values of M between these two 
bounds are acceptable for most purposes and will be used hereafter. 

But the bandwidth B is not the transmission bandwidth B;; rather it’s the mini- 
mum bandwidth necessary for modulation by a tone of specified amplitude and fre- 
quency. To estimate By, we should calculate the maximum bandwidth required when 
the tone parameters are constrained by A,, = 1 and f, = W. For this purpose, the 
dashed line in Fig. 5.2-1 depicts the approximation 


M(B) ~ B+ 2 [2] 


which falls midway between the solid lines for B = 2. Inserting Eq. (2) into Eq. (1) 
gives 


B~ 2B +2)f, =2 (24 + 2) f = Anfa 2f.) 
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Figure 5.2-1 The number of significant sideband pairs as a function of 8 [or D). 








5.2 Transmission Bandwidth and Distortion 


Now, bearing in mind that f4 is a property of the modulator, what tone produces the 
maximum bandwidth? Clearly, it is the maximum-amplitude-maximum-frequency 
tone having A,, = land f, = W. The worst-case tone-modulation bandwidth is then 


B,e2f.A-2W) ifB>2 


Note carefully that the corresponding modulation index 8 = f,/W is not the maxi- 
mum value of B but rather the value which, combined with the maximum modulat- 
ing frequency, yields the maximum bandwidth. Any other tone having A, < 1 or 
fa < W will require less bandwidth even though 8 may be larger. 

Finally, consider a reasonably smooth but otherwise arbitrary modulating signal 
having the message bandwidth W and satisfying the normalization convention 
|x(t)| = 1. We'll estimate B7 directly from the worst-case tone-modulation analysis, 
assuming that any component in x(t) of smaller amplitude or frequency will require a 
smaller bandwidth than B7. Admittedly, this procedure ignores the fact that superposi- 
tion is not applicable to exponential modulation. However, our investigation of multi- 
tone spectra has shown that the beat-frequency sideband pairs are contained primarily 
within the bandwidth of the dominating tone alone, as illustrated by Fig. 5.1-9. 


Therefore, extrapolating tone modulation to an arbitrary modulating signal, we 
define the deviation ratio 


pata [3] 
W 
which equals the maximum deviation divided by the maximum modulating fre- 
quency, analogous to the modulation index of worst-case tone modulation. The 
transmission bandwidth required for x(t) is then 


B; = 2M(D)W [4] 


where D is treated just like £ to find M(D), say from Fig. 5.2-1. 
Lacking appropriate curves or tables for M(D), there are several approximations 
to B, that can be invoked. With extreme values of the deviation ratio we find that 


aM Dp 
T law D&I 


paralleling our results for tone modulation with 8 very large or very small. Both of 
these approximations are combined in the convenient relation 


D> 1 
= Ufa t = 2(D + 
known as Carson’s rule. Perversely, the majority of actual FM systems have 2 < 
D « 10, for which Carson's rule somewhat underestimates the transmission band- 
width. A better approximation for equipment design is then 


Br~2(f,t+2W) =2D+2)W D>2 [6] 
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which would be used, for example, to determine the 3 dB bandwidths of FM ampli- 
fiers. Note that Carson's rule overestimates By for some applications using the 
narrowband approximation. The bandwidth of the transmitted signal in Example 
5.1-2 is 400 Hz, whereas Eq. (5) estimates B; ~ 420 Hz. 

Physically, the deviation ratio represents the maximum phase deviation of an 
FM wave under worst-case bandwidth conditions. Our FM bandwidth expressions 
therefore apply to phase modulation if we replace D with the maximum phase devi- 
ation $4 of the PM wave. Accordingly, the transmission bandwidth for PM with 
arbitrary x(f) is estimated to be 


Br=2M(b,)W Ma) = 1 [7a] 


or 


Br = 2($4 + 1)W [7b] 


which is the approximation equivalent to Carson’s rule. These expressions differ 
from the FM case in that d, is independent of W. 

You should review our various approximations and their conditions of validity. 
In deference to most of the literature, we'll usually take B; as given by Carson's rule 
in Eqs. (5) and (7b). But when the modulating signal has discontinuities—a rectan- 
gular pulse train, for instance—the bandwidth estimates become invalid and we 
must resort to brute-force spectral analysis. 





XAMPLE 5.2-1 Commercial FM Bandwidth 


Commercial FM broadcast stations in the United States are limited to a maximum 
frequency deviation of 75 kHz, and modulating frequencies typically cover 30 Hz to 
15 kHz. Letting W = 15 kHz, the deviation ratio is D = 75 kHz/15 kHz = 5 and Eq. 
(6) yields Bp = 2(5 + 2) X 15 kHz = 210 kHz. High-quality FM radios have band- 
widths of at least 200 kHz. Carson’s rule in Eq. (5) underestimates the bandwidth, 
giving By ~ 180 kHz. 

If a single modulating tone has A,, = 1 and fa = 15 kHz, then B = 5, M(B) ~ 7, 
and Eq. (1) shows that B = 210 kHz. A lower-frequency tone, say 3 kHz, would 
result in a larger modulation index (B = 25), a greater number of significant sideband 
pairs (M = 27), but a smaller bandwidth since B = 2 X 27 X 3 kHz = 162 kHz. 





EXERCISE 5.2-1 Calculate B/W for D = 0.3, 3, and 30 using Eqs. (5) and (6) where applicable. 





Linear Distortion 


The analysis of distortion produced in an FM or PM wave by a linear network is an 
exceedingly knotty problem— so much so that several different approaches to it 
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have been devised, none of them easy. Panter (1965) devotes three chapters to the 
subject and serves as a reference guide. Since we're limited here to a few pages, we 
can only view the “tip of the iceberg." Nonetheless, we'll gain some valuable 
insights regarding linear distortion of FM and PM. 

Figure 5.2-2 represents an exponentially modulated bandpass signal x,(t) 
applied to a linear system with transfer function H(f), producing the output y,(f). 
The constant-amplitude property of x,(t) allows us to write the lowpass equivalent 
input 


xlt) = 1A 20 8] 


where (t) contains the message information. In terms of X(f), the lowpass 
equivalent output spectrum is 


Yel(f) = Af + SAUS + £X) [9] 
Lowpass-to-bandpass transformation finally gives the output as 
y(t) = 2 Re [yo(r)e/*] [10] 


While this method appears simple on paper, the calculations of Xe (f) = #[x¢,(t) ] 


and ye (t) = ¥*[Y,,(f)] generally prove to be major stumbling blocks. Computer- 


aided numerical techniques are then necessary. 

One of the few cases for which Eqs. (8)-(10) yield closed-form results is the 
transfer function plotted in Fig. 5.2-3. The gain |H(f)| equals Ko at f, and increases 
(or decreases) linearly with slope K,/f,; the phase-shift curve corresponds to carrier 


T i En 


Hf) 


Figure 5.2-2 
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delay ty and group delay t,, as discussed in Example 4.1-1. The lowpass equivalent 
of H(f) is 


Aft+fju(ft+f) = (x. + mre meten 


and Eq. (9) becomes 


. K l : 
Lah Roe ODE) e T sp Pu “Hel Far f )X S (f£ Je PF] 


[^ 


Invoking the time-delay and differentiation theorems for 9» '[Y,( f) ] we see that 


f K ; 
TO rne — 1) +The Me det — n) 


c 


where 


" dii ; A 
Zelt — t4) = tlla, e| = Adli = ty Jeje 
obtained from Eq. (8). 


Inserting these expressions into Eq. (10) gives the output signal 


y(t) = A(t) cos [wht — to) + lt — t)] [al 
which has a time-varying amplitude 


A(t) = A Ke + P = «)| [11b] 


In the case of an FM input, $(t) = 27, x(t) so 


A(t) = Ad Ke + Eula t - 4) [12] 
c 

Equation (12) has the same form as the envelope of an AM wave with u = 
K, fxlKof,. We thus conclude that |H()| in Fig. 5.2-3 produces FM-to-AM conver- 
sion, along with the carrier delay ty and group delay t, produced by arg H(f). (By. 
the way, a second look at Example 4.2-2 reveals that amplitude distortion of an AM 
wave can produce AM-to-PM conversion.) 

FM-to-AM conversion does not present an insurmountable problem for FM or 
PM transmission, as long as $(t) suffers no ill effects other than time delay. We 
therefore ignore the amplitude distortion from any reasonably smooth gain curve. 
But delay distortion from a nonlinear phase-shift curve can be quite severe and must 
be equalized in order to preserve the message information. 

A simplified approach to phase-distortion effects is provided by the quasi-static 
approximation which assumes that the instantaneous frequency of an FM wave with 
fa 7 W varies so slowly compared to 1/W that x(t) looks more or less like an 
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ordinary sinusoid at frequency f(t) = f. + fa x(t). For if the system's response to a 
carrier-frequency sinusoid is 


yet) = A,B()| cos [wt + arg HC)] 


and if x(t) has a slowly changing instantaneous frequency f(t), then 











y(t) = A-|A[f(t)]| cos {mt + P(t) + arg AL F(t) ]} [13] 
It can be shown that this approximation requires the condition 
E 1 dH(f) 
$ (tux oS << 8m? [14] 
| ( | a H(f) df? at 


in which | (t)| = 4« ?fAW for tone-modulated FM with f, = W. If H(f) represents 
a single-tuned bandpass filter with 3 dB bandwidth B, then the second term in Eq. 
(14) equals 8/B? and the condition becomes 4/AW/B? << 1 which is satisfied by the 
transmission bandwidth requirement B = Bz. 

Now suppose that Eq. (14) holds and the system has a nonlinear phase shift 


such as arg H(f) = af”, where a is a constant. Upon substituting f(f) = f. + $T 
we get 


arg HU ()] = af? + SI Gi) + 80 


Thus, the total phase in Eq. (13) will be distorted by the addition of (r) and $?(r). 





EXERCISE 5.2-2 


Let |H(f)| = 1 and arg A(f) = —2art(f — f.). Show that Eqs. (11) and (13) give 
the same result with $(t) = B sin c, t provided that wp t, << T. 





Nonlinear Distortion and Limiters 


Amplitude distortion of an FM wave produces FM-to-AM conversion. Here we'll 
show that the resulting AM can be eliminated through the use of controlled nonlin- 
ear distortion and filtering. 


For purposes of analysis, let the input signal in Fig. 5.2-4 be 
Vint) = A(t) cos 0 (t) 
where 0t) = ct + $(t) and A(f) is the amplitude. The nonlinear element is 
assumed to be memoryless—meaning no energy storage—so the input and output 


are related by an instantaneous nonlinear transfer characteristic Vou = T[v;,]. We'll 
also assume for convenience that T[0] = 0. 


Although v,,(f) is not necessarily periodic in time, it may be viewed as a peri- 
odic function of 0, with period 277. (Try to visualize plotting Vin versus 0, with time 
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Nonlinear element 


< > Volt) = T Vial] 


Vint) 





Figure 5.2-4 


held fixed.) Likewise, the output is a periodic function of 0, and can be expanded in 
the trigonometric Fourier series 


ceo 
Us = >, [2a,| cos (n 0, + arg an) [15a] 
n-i 
where 
1 -- jn8, 
a, — ——| T[vyle 7" q6, [15b] 
2T Be 


The time variable t does not appear explicitly here, but Vou depends on t via the 
time-variation of 0.. Additionally, the coefficients a, may be functions of time when 
the amplitude of vin has time variations. 


But we'll first consider the case of an undistorted FM input, so A(f) equals the 


constant A, and all the a, are constants. Hence, writing out Eq. (15a) term by term 
with ¢ explicitly included, we have 


Vout) = |2a4| cos [ot + A(t) + arg ai] [16] 
+ |2a,| cos [2w,t + 26(t) + arg a;] 
+ em 


i 


i eno Um add LHS 


This expression reveals that the nonlinear distortion produces additional FM waves 
at harmonics of the carrier frequency, the nth harmonic having constant amplitude 
[2a,] and phase modulation n@(t) plus a constant phase shift arg a,. 

If these waves don't overlap in the frequency domain, the undistorted input can 
be recovered by applying the distorted output to a bandpass filter. Thus, we say that 
FM enjoys considerable immunity from the effects of memoryless nonlinear 
distortion. 

Now let's return to FM with unwanted amplitude variations A(t). Those varia- 
tions can be flattened out by an ideal hard limiter or clipper whose transfer char- 
acteristic is plotted in Fig. 5.2-5a. Figure 5.2—5b shows a clipper circuit employing 
back-to-back Zener diodes with breakdown voltage V, at the output of a high-gain 
amplifier. 


The clipper output looks essentially like a square wave, since T[v,,] = 
V, Sgn Vin and 
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Vout 
+ Vo 

Oo—| 
+ + 

0 Vin 
Vin Vout 

-V9 E E 
[a] (b) 
Figure 5.2-5 Hard limiter. (a) Transfer characteristic; (b] circuit realization with Zener diodes. 


The coefficients are then found from Eq. (15b) to be 


AVo/arn n= 1,5,9,... 
a, = 4 —AVy/an n = 3,7,11,... 
0 n= 2,4,6, ... 


which are independent of time because A(t) = 0 does not affect the sign of Vin 
Therefore, 


Vout) = Lae + (t)] — Su cos [3o,t + 39(r)) + = [17] 
T 3 

and bandpass filtering yields a constant-amplitude FM wave if the components of 

Ug (£) have no spectral overlap. Incidentally, this analysis lends support to the previ- 

ous statement that message information resides entirely in the zero-crossings of an 

FM or PM wave. 

Figure 5.2-6 summarizes our results. The limiter plus BPF in part a removes 
unwanted amplitude variations from an AM or PM wave, and would be used in a 
receiver. The nonlinear element in part b distorts a constant-amplitude wave, but the 
BPF passes only the undistorted term at the nth harmonic. This combination acts as 
a frequency multiplier if n > 1, and is used in certain types of transmitters. 


A(t) cos [wt + H(A] —<-» n cos [wst + dt) 


la) 
A. cos [wt + b(t] —< A [2a,| cos [nwt + no(t) + arg a,] 
(b) 


Figure 5.2-6 Nonlinear processing circuits. [a] Amplitude limiter; (b) frequency multiplier. 
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5.3 GENERATION AND DETECTION OF FM AND PM 


The operating principles of several methods for the generation and detection of 
exponential modulation are presented in this section. Other FM and PM systems that 
involve phase-lock loops will be mentioned in Sect. 7.3. Additional methods and 
information regarding specific circuit designs can be found in the radio electronics 
texts cited at the back of the book. 

When considering equipment for exponential modulation, you should keep in 
mind that the instantaneous phase or frequency varies linearly with the message 
waveform. Devices are thus required that produce or are sensitive to phase or fre- 
quency variation in a linear fashion. Such characteristics can be approximated in a 
variety of ways, but it is sometimes difficult to obtain a suitably linear relationship 
over a wide operating range. 

On the other hand, the constant-amplitude property of exponential modulation 
is a definite advantage from the hardware viewpoint. For one thing, the designer 
need not worry about excessive power dissipation or high-voltage breakdown due to 
extreme envelope peaks. For another, the relative immunity to nonlinear distortion 
allows the use of nonlinear electronic devices that would hopelessly distort a signal 
with linear modulation. Consequently, considerable latitude is possible in the design 
and selection of equipment. As a case in point, the microwave repeater links of long- 
distance telephone communications employ FM primarily because the wideband 


linear amplifiers required for amplitude modulation are unavailable at microwave 
frequencies. 


Direct FM and VCOs 


Conceptually, direct FM is straightforward and requires nothing more than a voltage- 
controlled oscillator (VCO) whose oscillation frequency has a linear dependence on 
applied voltage. It's possible to modulate a conventional tuned-circuit oscillator by 
introducing a variable-reactance element as part of the LC parallel resonant circuit. 
If the equivalent capacitance has a time dependence of the form 


C(t) = Co — Cx(t) 


and if Cx(f) is "small enough” and “slow enough,” then the oscillator produces 
x(t) = A, cos 6,(t) where 





84) = 4-1 - Ex)" 
At) = = 1-—~xt 
VLC VLC, Co 
Letting w, = 1/ V LC and assuming |(C/Cp)x(t)| << 1, the binomial series expan- 
sion gives 6,(t) = [1 + (C/2Co)x(t)], or 


rt 


C 
Ce ie EA f| x(A)dÀ n] 
326, 
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Tuned 
N:1 RFC DC block circuit 
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JE + 
x(t) 
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Figure 5.3-1 VCO circuit with varactor diode for variable reactance. 


which constitutes frequency modulation with fa = (C/2C))f.. Since |x(t)| = 1, the 
approximation is good to within 1 percent when C/C, < 0.013 so the attainable fre- 
quency deviation is limited by 


C 
a= 26, = 0.006 f. [2] 


This limitation quantifies our meaning of Cx(t) being "small" and seldom imposes 
a design hardship. Similarly, the usual condition W «& f, ensures that Cx(t) is “slow 
enough." 

Figure 5.3-1 shows a tuned-circuit oscillator with a varactor diode biased to get 
Cx(t). The input transformer, RF choke (RFC), and dc block serve to isolate the low- 
frequency, high-frequency, and dc voltages. The major disadvantage with this type 
of circuit is that the carrier frequency tends to drift and must be stabilized by rather 
elaborate feedback frequency control. For this reason, many older FM transmitters 
are of the indirect type. 

Linear integrated-circuit (IC) voltage-controlled. oscillators can generate a 
direct FM output waveform that 1s relatively stable and accurate. However, in order 
to operate, IC VCOs require several additional external components to function. 
Because of their low output power, they are most suitable for applications such as 
cordless telephones. Figure 5.3—2 shows the schematic diagram for a direct FM 
transmitter using the Motorola MC1376, an 8-pin IC FM modulator. Tbe MC1376 
operates with carrier frequencies between 1.4 and 14 MHz. The VCO is fairly linear 
between 2 and 4 volts and can produce a peak frequency deviation of approximately 
150 kHz. Higher power outputs can be achieved by utilizing an auxiliary transistor 
connected to a 12-V power supply. 


Phase Modulators and Indirect FM 


Although we seldom transmit a PM wave, we're still interested in phase modulators 
because: (1) the implementation is relatively easy; (2) the carrier can be supplied by 
a stable frequency source, such as a crystal-controlled oscillator; and (3) integrating 
the input signal to a phase modulator produces a frequency-modulated output. 
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Figure 5.3-2 Schematic diagram of IC VCO direct FM generator utilizing the Motorola 
MC1376. 
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Figure 5.3-3 Narrowband phase modulator. 


Figure 5.3-3 depicts a narrowband phase modulator derived from the approxi- 
mation x,(t) = A,cos wt — A.Q ax(t) sin o, t—see Eqs. (9) and (11), Sect. 5.1. 
The evident simplicity of this modulator depends upon the approximation condi- 
tion |óAx(£)| <1 radian, and phase deviations greater than 10° result in distorted 
modulation. 

Larger phase shifts can be achieved by the switching-circuit modulator in 
Fig. 5.3-4. The typical waveforms shown in Fig. 5.3-4 help explain the operation. 
The modulating signal and a sawtooth wave at twice the carrier frequency are 
applied to a comparator. The comparator’s output voltage goes high whenever x(t) 
exceeds the sawtooth wave, and the flip-flop switches states at each rising edge of a 
comparator pulse. The flip-flop thus produces a phase-modulated square wave (like 
the output of a hard limiter), and bandpass filtering yields x,(2). 

Now consider the indirect FM transmitter diagrammed in Fig. 5.3—5. The inte- 
grator and phase modulator constitute a narrowband frequency modulator that gen- 
erates an initial NBFM signal with instantaneous frequency 


fl) =f, Rs x) 
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Figure 5.3-4 Switching-circuit phase modulator. (a) Schematic diagram (b] waveforms. 
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Figure 5.3-5 Indirect FM transmitter. 


where T is the integrator's proportionality constant. The initial frequency deviation 
therefore equals $4/27T and must be increased to the desired value fa by a fre- 
quency multiplier. 

The frequency multiplier produces n-fold multiplication of instantaneous fre- 
quency, so 


At) = nfi(t) = nf, + fax(t) [3] 


_ Pa 
fa = (25) 


where 
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Typical frequency multipliers consist of a chain of doublers and triplers, each unit 
constructed as shown in Fig. 5.2-6b. Note that this multiplication is a subtle process, 
affecting the range of frequency variation but not the rate. Frequency multiplication 
of a tone-modulated signal, for instance, increases the carrier frequency and modu- 
lation index but not the modulation frequency, so the amplitude of the sideband 
lines is altered while the line spacing remains the same. (Compare the spectra in 
Fig. 5.1-7a with B = 5 and B = 10.) 

The amount of multiplication required to get fa usually results in nf,, being much 
higher than the desired carrier frequency. Hence, Fig. 5.3—5 includes a frequency con- 
verter that translates the spectrum intact down to f. = Inf. + fio| and the final 
instantaneous frequency becomes f(t) = f. + fax(t). (The frequency conversion 
may actually be performed in the middle of the multiplier chain to keep the frequen- 
cies at reasonable values.) The last system component is a power amplifier, since all 
of the previous operations must be carried out at low power levels. Note the similar- 
ity to the ring modulator discussed in Sect. 4.3 that is used to generate DSB signals. 





EXAMPLE 5.3-1 The indirect FM system originally designed by Armstrong employed a narrowband 
phase modulator in the form of Fig. 5.3—3 and produced a minute initial frequency devi- 
ation. As an illustration with representative numbers, suppose that @,/27T = 15 Hz 
(which ensures negligible modulation distortion) and that f. = 200 kHz (which falls 
near the lower limit of practical crystal-oscillator circuits). A broadcast FM output 
with fA = 75 kHz requires frequency multiplication by the factor n = 75,000 + 15 = 
5000. This could be achieved with a chain of four triplers and six doublers, so 
n = 3^ x 2° = 5184. But nf. ~ 5000 x 200 kHz = 1000 MHz, and a down- 
converter with f; 9 ~ 900 MHz is needed to put f, in the FM band of 88-108 MHz. 





EXERCISE 5.3-1 Show that the phase at the output of Fig. 5.3-3 is given by 
P(t) = éax(t) — 3h ax%(t) + $6) + c [4] 


Hence, (ft) contains odd-harmonic distortion unless $4 is quite small. 





Triangular-Wave FM* 


Triangular-wave FM is a modern and rather novel method for frequency modulation 
that overcomes the inherent problems of conventional VCOs and indirect FM sys- 
tems. The method generates virtually distortionless modulation at carrier frequen- 
cies up to 30 MHz, and is particularly well suited for instrumentation applications. 
We'll define triangular FM by working backwards from x,(f) = A, cos 6,(f) with 


A(t) = ext + (t) — (0) 
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where the initial phase shift — (0) has been included so that 0,(0) = 0. This phase 
shift does not affect the instantaneous frequency 


s 
f(t) = Dine 6 (t) "Xs + fax(t) 
Expressed in terms of 0,(1), a unit-amplitude triangular FM signal is 
2 
x(t) = = arcsin [cos 6,(t) | [5a] 


which defines a triangular waveform when @(f) = 0. Even with $(t) # 0, Eq. (5a) 
represents a periodic triangular function of 0,, as plotted in Fig. 5.3-6a. Thus, 


2 
I= D Roe T 
T 
XA CC 2 [5b] 
-3 +76. m eg. m 


and so forth for 6, > 2r. 
Figure 5.3-6b shows the block diagram of a system that produces x,(t) from 
the voltage 


v(t) = 6) = AU. + fo) 





Electronic switch 


v(t) = 26.0) as! J x0) 


= AU + fax] 
1 Schmitt trigger 


(b) 


Figure 5.3-6 Triangular-wave FM. (a) Waveform; (b) modulation system. 
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which is readily derived from the message waveform x(t). The system consists of an 
analog inverter, an integrator, and a Schmitt trigger controlling an electronic switch. 
The trigger puts the switch in the upper position whenever x,(t) increases to +1 and 
puts the switch in the lower position whenever x,(¢) decreases to —1. 


Suppose the system starts operating at t = O with x,(0) = +1 and the switch in 
the upper position. Then, for 0 < t < t, 


rat) -1- [va = 1 - 2 [0,0 = 0] 
0 


2 
Les). quse 


so x,(t) traces out the downward ramp in Fig. 5.3-6a until time t, when 


xati) = —1, corresponding to 6,(t,) = sr. Now the trigger throws the switch to the 
lower position and 


xa(t) = —1 + | uijdos cq = iodo) - 6.(t;)] 


f 


2 
—3 + = 0t) f «t«t 


so x4(t) traces out the upward ramp in Fig. 5.3-6a. The upward ramp continues until 
time t, when 0,(t;) = 27 and x4(t;) = +1. The switch then triggers back to the 
upper position, and the operating cycle goes on periodically for t > t. 

A sinusoidal FM wave is obtained from x,(t) using a nonlinear waveshaper 
with transfer characteristics T[x,(t)] = A, sin [(7/2)x4(17)], which performs the 
inverse of Eq. (5a). Or x,4(t) can be applied to a hard limiter to produce square- 
wave FM. A laboratory test generator might have all three outputs available. 


Frequency Detection 


A frequency detector, often called a discriminator, produces an output voltage 
that should vary linearly with the instantaneous frequency of the input. There are 
perhaps as many different circuit designs for frequency detection as there are 
designers who have considered the problem. However, almost every circuit falls into 
one of the following four operational categories: 

1. FM-to-AM conversion 

2. Phase-shift discrimination 

3. Zero-crossing detection 

4. Frequency feedback 


We'll lock at illustrative examples from the first three categories, postponing fre- 
quency feedback to Sect. 7.3. Analog phase detection 1s not discussed here because 
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Figure 5.3-7 (a) Frequency detector with limiter and FM-to-AM conversion; (b) waveforms. 


it’s seldom needed in practice and, if needed, can be accomplished by integrating 
the output of a frequency detector. 

Any device or circuit whose output equals the time derivative of the input pro- 
duces FM-to-AM conversion. To be more specific, let x(f) = A,cos 0.(t) with 
Òt) = 2 f, + fax(t)]; then 


x(t) = —A6,{t) sin (t) [6] 
= 2q Af. + fx x(t)] sin [G(r) + 180°] 


Hence, an envelope detector with input x,(t) yields an output proportional to 
F(t) = f. + fax). 

Figure 5.3-7a diagrams a conceptual frequency detector based on Eq. (6). The 
diagram includes a limiter at the input to remove any spurious amplitude variations 
from x(t) before they reach the envelope detector. It also includes a dc block to 
remove the constant carrier-frequency offset from the output signal. Typical wave- 
forms are sketched in Fig. 5.3-7b taking the case of tone modulation. 
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For actual hardware implementation of FM-to-AM conversion, we draw upon 
the fact that an ideal differentiator has |H( f) | = 2f. Slightly above or below reso- 
nance, the transfer function of an ordinary tuned circuit shown in Fig. 5.3-8a approx- 
imates the desired linear amplitude response over a small frequency range. Thus, for 
instance, a detuned AM receiver will roughly demodulate FM via slope detection. 

Extended linearity is achieved by the balanced discriminator circuit in 
Fig. 5.3-8b5. A balanced discriminator includes two resonant circuits, one tuned 
above f, and the other below, and the output equals the difference of the two 
envelopes. The resulting frequency-to-voltage characteristic takes the form of the 
well-known S curve in Fig. 5.3-8c. No dc block is needed, since the carrier-frequency 
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Figure 5.3-8 (a) Slope detection with a tuned circuit; {b} balanced discriminator circuit; 


(c] frequency-to-voltage choracteristic. 
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offset cancels out, and the circuit has good performance at low modulating frequen- 
cies. The balanced configuration easily adapts to the microwave band, with resonant 
cavities serving as tuned circuits and crystal diodes for envelope detectors. 

Phase-shift discriminators involve circuits with linear phase response, in con- 
trast to the linear amplitude response of slope detection. The underlying principle 
comes from an approximation for time differentiation, namely 


a(t) = = [v(¢) — v(t — 1)] m 


] 
ti 


providing that 7; is small compared to the variation of v(t). Now an FM wave has 
P(t) = 2«rfax(1) so 

P(t) — H(t — th) =~ h(t) = 2m fati (t) [8] 
The term $(t — 1;) can be obtained with the help of a delay line or, equivalently, a 
linear phase-shift network. 

Figure 5.3—9 represents a phase-shift discriminator built with a network having 
group delay 7, and carrier delay tọ such that «t, = 90?— which accounts for the 
name quadrature detector. From Eq. (11), Sect. 5.2, the phase-shifted signal is pro- 
portional to cos[w,t — 90? + (t — t,)] = sin [ot + (t — t;)]. Multiplication 
by cos [w,t + (t)] followed by lowpass filtering yields an output proportional to 


sin|é(t) — P(t — t)] = H(t) — ét — t) 
assuming t, is small enough that |@(t) — (t — t,)| << m. Therefore, 
yolt) ~ Kpfax(t) 


where the detection constant Kp includes t,. Despite these approximations, a quad- 
rature detector provides better linearity than a balanced discriminator and is often 
found in high-quality receivers. 

Other phase-shift circuit realizations include the Foster-Seely discriminator 
and the popular ratio detector. The latter is particularly ingenious and economical, 
for it combines the operations of limiting and demodulation into one unit. See 
Tomasi (1998, Chap. 7) for further details. 


cos [w,t + P(t)] 
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Figure 5.3-9 Phase-shift discriminator or quadrature detector. 
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Figure 5.3-10 Zero-crossing detector. {a} Diagram; (b) waveforms. 


Lastly, Fig. 5.3-10 gives the diagram and waveforms for a simplified zero- 
crossing detector. The square-wave FM signal from a hard limiter triggers a mono- 
stable pulse generator, which produces a short pulse of fixed amplitude A and duration 
7 at each upward (or downward) zero crossing of the FM wave. If we invoke the quasi- 
static viewpoint and consider a time interval T such that W «€ 1/T < f. the mono- 
stable output v(t) looks like a rectangular pulse train with nearly constant period 1/f (t). 
Thus, there are ny = Tf(t) pulses in this interval, and continually integrating v(t) over 
the past T seconds yields 

1 t 
r| 


t- 


v(A) dA = L mAr = ATf(t) 
T 


which becomes yp(t) = Kpfax(t) after the dc block. 

Commercial zero-crossing detectors may have better than 0.1 percent linearity 
and operate at center frequencies from 1 Hz to 10 MHz. A divide-by-ten counter 
inserted after the hard limiter extends the range up to 100 MHz. 

Today most FM. communication devices utilize linear integrated circuits for FM 
detection. Their reliability, small size, and ease of design have fueled the growth of 
portable two-way FM and cellular radio communications systems. Phase-lock loops 
and FM detection will be discussed in Sect. 7.3. 





Given a delay line with time delay t; << 1/f,, devise a frequency detector based on 
Eqs. (6) and (7). 


EXERCISE 5.3-2 
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5.4 INTERFERENCE 


Interference refers to the contamination of an information-bearing signal by another 
similar signal, usually from a human source. This occurs in radio communication 
when the receiving antenna picks up two or more signals in the same frequency 
band. Interference may also result from multipath propagation, or from electromag- 
netic coupling between transmission cables. Regardless of the cause, severe inter- 
ference prevents successful recovery of the message information. 

Our study of interference begins with the simple but nonetheless informative 
case of interfering sinusoids, representing unmodulated carrier waves. This simpli- 
fied case helps bring out the differences between interference effects in AM, FM, 
and PM. Then we'll see how the technique of deemphasis filtering improves FM 
performance in the face of interference. We conclude with a brief examination of the 
FM capture effect. 


Interfering Sinusoids 
Consider a receiver tuned to some carrier frequency f.. Let the total received signal be 
u(t) = Acos o, t + A;cos [(w, + w)t + $1] 


The first term represents the desired signal as an unmodulated carrier, while the sec- 
ond term is an interfering carrier with amplitude A;, frequency f. + f, and relative 
phase angle ¢;. 
To put u(t) in the envelope-and-phase form u(t) = A,(t) cos [ot + ¢,(t)], 
we'll introduce 
p = Ai/ A, 0t) 2 wt + Q; ur 
Hence, A; = pA, and the phasor construction in Fig. 5.4-1 gives 


A,(t) ^ A,V 1 + p? + 2p cos 0t) [2] 


p sin Ot) 


= t ss mm 
PaA atan 1 + pcos Ot) 


A, p sin 00) 





Ac [1 + p cos 6,(t)] 


Figure 5.4-1 Phasor diagram of interfering carriers. 
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These expressions show that interfering sinusoids produce both amplitude and 
phase modulation. In fact, if p << 1 then 


A(t) ^ A,{1 + pcos (w;t + $)] [3] 
Plt) = p sin (w;t + 6) 


which looks like tone modulation at frequency f; with AM modulation index u = p 
and FM or PM modulation index 8 = p. At the other extreme, if p >> 1 then 


A(t) = A[1 + p^'cos (wt + 61] 
$ (t) Pe w;t T $i 


so the envelope still has tone modulation but the phase corresponds to a shifted car- 
rier frequency f, + f; plus the constant @;. 

Next we investigate what happens when v(t) is applied to an ideal envelope, 
phase, or frequency demodulator with detection constant Kp. We'll take the weak 
interference case (p << 1) and use the approximation in Eq. (3) with ¢; = 0. Thus, 
the demodulated output is 


Kp(1 + p cos ct) AM 
yp(t) ~ 4 Kppsin et PM [4] 
Kppficos ct FM 


provided that |f£| = W— otherwise, the lowpass filter at the output of the demodula- 
tor would reject |f| > W. The constant term in the AM result would be removed if 
the demodulator includes a dc block. As written, this result also holds for synchro- 
nous detection in DSB and SSB systems since we've assumed d; = 0. The multi- 
plicative factor f; in the FM result comes from the instantaneous frequency deviation 
P (0/27. 

Equation (4) reveals that weak interference in a linear modulation system or 
phase modulation system produces a spurious output tone with amplitude propor- 
tional to p = A,/A,, independent of f. But the tone amplitude is proportional to pf; 
in an FM system. Consequently, FM will be less vulnerable to interference from a 
cochannel signal having the same carrier frequency, so f; ~ 0, and more vulnerable 
to adjacent-channel interference (f; # 0). Figure 5.4-2 illustrates this difference in 


3 AM and 
2] PM 

[2^ 

E 

< 

i | fi! 
0 Ww 
Figure 5.4-2 Amplitude of demodulated interference from a carrier at frequency f. + f. 
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the form of a plot of demodulated interference amplitude versus |f;|. (The crossover 
point would correspond to |f| = 1 Hz if all three detector constants had the same 
numerical value.) 

The analysis of demodulated interference becomes a much more difficult task 
with arbitrary values of p and/or modulated carriers. We'll return to that problem 
after exploring the implications of Fig. 5.4-2. 





Let A; = A, so p = 1 in Eq. (2). Take $; = 0 and use trigonometric identities to show 
that 


EXERCISE 5.4-1 


A,(t) T 2A, |cos (w; t/2)| P(t) = 0j t/2 


Then sketch the demodulated output waveform for envelope, phase, and frequency 
detection assuming f, << W. 





Deemphasis and Preemphasis Filtering 


The fact that detected FM interference is most severe at large values of |f; | suggests 
a method for improving system performance with selective postdetection filtering, 
called deemphasis filtering. Suppose the demodulator is followed by a lowpass fil- 
ter having an amplitude ratio that begins to decrease gradually below W; this will 
deemphasize the high-frequency portion of the message band and thereby reduce the 
more serious interference. À sharp-cutoff (ideal) lowpass filter is still required to 
remove any residual components above W, so the complete demodulator consists of 
a frequency detector, deemphasis filter, and lowpass filter, as in Fig. 5.4-3. 

Obviously deemphasis filtering also attenuates the high-frequency components of 
the message itself, causing distortion of the output signal unless corrective measures 
are taken. But it's a simple matter to compensate for deemphasis distortion by predis- 
torting or preemphasizing the modulating signal at the transmitter before modula- 
tion. The preemphasis and deemphasis filter characteristics should be related by 


TE M 
Half) 


to yield net undistorted transmission. In essence, 


Hy) f| sw [5] 





Frequency Deemphasis LPF 
det filter | 


Figure 5.4-3 Complete FM demodulator. 
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Preemphasis/deemphasis filtering offers potential advantages whenever unde- 
sired contaminations tend to predominate at certain portions of the message band. 
For instance, the Dolby system for tape recording dynamically adjusts the amount of 
preemphasis/deemphasis in inverse proportion to the high-frequency signal content; 
see Stremler (1990, App. F) for details. However, little is gained from deemphasiz- 
ing phase modulation or linear modulation because the demodulated interference 
amplitude does not depend on the frequency. 

The FM deemphasis filter is usually a simple first-order network having 


f -} 1 lf | < Bac 
wn-[AP fe en 
J 


where the 3 dB bandwidth B,, is considerably less than the message bandwidth 
W. Since the interference amplitude increases linearly with |f; | in the absence of fil- 
tering, the deemphasized interference response is |H4( f)| X |f|, as sketched in 
Fig. 5.4-4. Note that, like PM, this becomes constant for || >> Bae- Therefore, 
FM can be superior to PM for both adjacent-channel and cochannel interference. 
At the transmitting end, the corresponding preemphasis filter function should be 


Hy(f) = £ (É) = 2 IF| > Ba [7] 


de 











which has little effect on the lower message frequencies. At higher frequencies, 
however, the filter acts as a differentiator, the output spectrum being proportional to 
fXCf) for | f | >> B,,. But differentiating a signal before frequency modulation is 
equivalent to phase modulation! Hence, preemphasized FM is actually a combina- 
tion of FM and PM, combining the advantages of both with respect to interference. 





FM 
" X 
"t" ^ 
E / PM 
cape disse 
< E: 

i FM with deemphasis 
| fi! 
0 Be K 


Figure 5.4-4 Demodulated interference amplitude with FM deemphasis filtering. 
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As might be expected, this turns out to be equally effective for reducing noise, as 
will be discussed in more detail in Chap. 10. 

Referring to H e(f) as given above, we see that the amplitude of the maximum 
modulating frequency is increased by a factor of W/B,., which means that the fre- 
quency deviation is increased by this same factor. Generally speaking, the increased 
deviation requires a greater transmission bandwidth, so the preemphasis-deemphasis 
improvement is not without price. Fortunately, many modulating signals of interest, 
particularly audio signals, have relatively little energy in the high-frequency end of 
the message band, and therefore the higher frequency components do not develop 
maximum deviation, the transmission bandwidth being dictated by lower compo- 
nents of larger amplitude. Adding high-frequency preemphasis tends to equalize the 
message spectrum so that all components require the same bandwidth. Under this 
condition, the transmission bandwidth need not be increased. 





Typical deemphasis and preemphasis networks for commercial FM are shown in EXAMPLE 5.4-1 
Fig. 5.4—5 along with their Bode diagrams. The RC time constant in both circuits 
equals 75 us, so Ba, = l1/2vRC = 2.1 kHz. The preemphasis filter has an upper 
break frequency at f, = (R + r)/27RrC, usually chosen to be well above the audio 


range, say f, = 30 kHz. 





Suppose an audio signal is modeled as a sum of tones with low-frequency amplitudes 
Am = 1 forfa = 1 kHz and high-frequency amplitudes Am = 1 kHz/f, for fn > 1 kHz. 
Use Eqs. (1) and (2), Sect. 5.2, to estimate the bandwidth required for a single tone at 
fm = 15 kHz whose amplitude has been preemphasized by |H,-(f)| given in Eq. (7) 
with B4, = 2 kHz. Assume f, = 75 kHz and compare your result with By = 210 kHz. 


EXERCISE 5.4-2 








[Hel dB 
0 





(b) 


Figure 5.4-5 (a) Deemphasis filter; (b) preemphasis filter. 
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FM Capture Effect% 


Capture effect is a phenomenon that takes place in FM systems when two signals 
have nearly equal amplitudes at the receiver. Small variations of relative amplitude 
then cause the stronger of the two to dominate the situation, suddenly displacing the 
other signal at the demodulated output. You may have heard the annoying results 
when listening to a distant FM station. 

For a reasonably tractable analysis of capture effect, we'll consider an unmodu- 
lated carrier with modulated cochannel interference (f, = 0). The resultant phase 
Q (t) is then given by Eq. (2) with 6,{t) = p(t), where ¢,(t) denotes the phase mod- 
ulation of the interfering signal. Thus, if Kp — 1 for simplicity, the demodulated 
signal becomes 





ain Ilao P PAO 
yp(t) = ¢,(t) = dt de aE p cos "o x 
= a(p, $j) $ (t) 
where 
ubi 2 pcd. i 


1+ p? + 2p cos o, 


The presence of $ (f) in Eq. (8a) indicates potentially intelligible interference (or 
cross talk) to the extent that a(p,@;) remains constant with time. After all, if 
p >> lthen a(p, $;) ~ land y,(t) = $ (t). 

But capture effect occurs when A; = A,, so p = 1 and Eq. (8b) does not imme- 
diately simplify. Instead, we note that 


p/(1 + p) $0, 2m,... 
alp, pi) = $ p*/(1 + p?) $,— + m[2, + 3m[2,... 
-p/l-p) $= Em, + 3mo 


and we resort to plots of a(p, ;) versus $; as shown in Fig. 5.4—6a. Except for the 
negative spikes, these plots approach a(p, $;) = 0.5 as p — 1, and thus y,(f) = 0.5 
ġ (t). For p < 1, the strength of the demodulated interference essentially depends 
on the peak-to-peak value 


ay, = alp, 0) ~ alp, T) = 2p/(1 — p?) 


which is plotted versus p in Fig. 5.4—65. This knee-shaped curve reveals that if 
transmission fading causes p to vary around a nominal value of about 0.7, the inter- 
ference almost disappears when p « 0.7 whereas it takes over and "captures" the 
output when p 0.7. 

Panter (1965, Chap. 11) presents a detailed analysis of FM interference, includ- 
ing waveforms that result when both carriers are modulated. 
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Figure 5.4-6 


5.5 PROBLEMS 


Sketch and label $(f) and f(t) for PM and FM when x(t) = AA(t/). Take 
$( — oo) = 0 in the FM case. 


Do Prob. 5.1-1 with x(t) = Acos(qrt/7)1I(t/27). 
4At 
Do Prob. 5.1-1 with x(t) = aya t> 4, 


-A frequency-sweep generator produces a sinusoidal output whose instantaneous fre- 


quency increases linearly from f, at t = 0 to f; att = T. Write 6,(t) forO s t T. 


Besides PM and FM, two other possible forms of exponential modulation are phase- 
integral modulation, with $(t) = K dx(t)/dt, and phase-acceleration modulation, 
with 


t 


f =f + r| x(A)dA 


Add these to Table 5.1—1 and find the maximum values of #(t) and f(t) for all four 
types when x (t) = cos 27 fnt. 


Use Eq. (16) to obtain Eq. (18a) from Eq. (15). 


Derive Eq. (16) by finding the exponential Fourier series of the complex periodic 
function exp (j B sin o, t). 


Tone modulation is applied simultaneously to a frequency modulator and a phase 
modulator and the two output spectra are identical. Describe how these two spectra 
will change when: (a) the tone amplitude is increased or decreased; (5) the tone fre- 
quency is increased or decreased; (c) the tone amplitude and frequency are increased 
or decreased in the same proportion. 
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Consider a tone-modulated FM or PM wave with f,, = 10 kHz, B = 2.0, A, = 100, 


and f, = 30 kHz. (a) Write an expression for f(t). (b) Draw the line spectrum and 
show therefrom that Sy < A2/2. 


Do Prob. 5.1-9 with f, = 20 kHz and f, = 40 kHz, in which case Sy > A2/2. 


Construct phasor diagrams for tone-modulated FM with A, = 10 and B = 0.5 when 
cy, = 0, 7/4, and 7/2. Calculate A and $ from each diagram and compare with the 
theoretical values. 


Do Prob. 5.1-11 with B — 1.0. 


A tone-modulated FM signal with B = 1.0 and fẹ, = 100 Hz is applied to an ideal 
BPF with B — 250 Hz centered at f, — 500. Draw the line spectrum, phasor dia- 
gram, and envelope of the output signal. 


Do Prob. 5.1-13 with B — 5.0. 


One implementation of a music synthesizer exploits the harmonic structure of FM 
tone modulation. The violin note C; has a frequency of f; = 405 Hz with harmon- 
ics at integer multiples of f; when played with a bow. Construct a system using FM 
tone modulation and frequency converters to synthesize this note with fy and three 
harmonics. 


Consider FM with periodic square-wave modulation defined by x(t) = 1 for 
O<t< 7/2 and x(t) = — 1 for — Ty/2 < t < 0. (a) Take $(0) = 0 and plot 
f(t) for — To/2 < t < To/2. Then use Eq. (20a) to obtain 

£) em 


Ls n+ ; nc 
e. Pert sino( 5 3 em 4 sinc 


where B = fT). (b) Sketch the resulting magnitude line spectrum when £ is a large 
integer. 








A message has W = 15 kHz. Estimate the FM transmission bandwidth for fa = 0.1, 
0.5, 1, 5, 10, 50, 100, and 500 kHz. 


Do Prob. 5.2-1 with W = 5 kHz. 


An FM system has fa = 10 kHz. Use Table 9.4—1 and Fig. 5.2-1 to estimate the 
bandwidth for: (a) barely intelligible voice transmission; (5) telephone-quality voice 
transmission: (c) high-fidelity audio transmission. 


A video signal with W = 5 MHz is to be transmitted via FM with fa = 25 MHz. Find 
the minimum carrier frequency consistent with fractional bandwidth considerations. 
Compare your results with transmission via DSB amplitude modulation. 


Your new wireless headphones use infrared FM transmission and have a frequency 
response of 30-15,000 Hz. Find B, and f, consistent with fractional bandwidth con- 
siderations, assuming f. = 5 x 10'^ Hz. 


| 
i 
i 
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A commercial FM radio station alternates between music and talk show/call-in for- 
mats. The broadcasted CD music is bandlimited to 15 kHz based on convention. 
Assuming D — 5 is used for both music and voice, what percentage of the available 
transmission bandwidth is used during the talk show if we take W = 5 kHz for 
voice signals? 


An FM system with f, = 30 kHz has been designed for W = 10 kHz. Approxi- 
mately what percentage of B; is occupied when the modulating signal is a unit- 


amplitude tone at f, = 0.1, 1.0, or 5.0 kHz? Repeat your calculations for a PM 
system with $4 = 3 rad. | 


Consider phase-integral and phase-acceleration modulation defined in Prob. 5.1-5. 
Investigate the bandwidth requirements for tone modulation, and obtain transmis- 
sion bandwidth estimates. Discuss your results. 


The transfer function of a single-tuned BPF is E( f) ~ 1/[1 + j2Q (f — f£J/f.] 
over the positive-frequency passband. Use Eq. (10) to obtain an expression for the 
output signal and its instantaneous phase when the input is an NBPM signal. 


Use Eq. (10) to obtain an expression for the output signal and its amplitude when 
an FM signal is distorted by a system having H(f) = Kg — K(f — f Y over the 
positive-frequency passband. 


Use Eq. (13) to obtain an expression for the output signal and its instantaneous 
frequency when an FM signal is distorted by a system having |H(f)| = 1 and 
arg H(f) = a,(f — f£.) + alf — f. over the positive-frequency passband. 


An FM signal is applied to the BPF in Prob. 5.2-9. Let a = 20f,/f, << 1 and use 
Eq. (13) to obtain an approximate expression for the output signal and its instanta- 
neous frequency. 


Let the input to the system in Fig. 5.2-6a be an FM signal with D = f,/W and spu- 
rious amplitude variations. Sketch the spectrum at the output of the limiter and show 
that successful operation requires fa < (f. — W)/2. 


The input to the system in Fig. 5.2-6b is an FM signal with D = fA/W and the BPF 
is centered at 3f,, corresponding to a frequency tripler. Sketch the spectrum at the fil- 


ter's input and obtain a condition on fa in terms of f, and W that ensures successful 
operation. 


Do Prob. 5.2-14 with the BPF centered at 4f, corresponding to a frequency 
quadrupler. 


The equivalent tuning capacitance in Fig. 5.3-1 is C(t) ^ Ci + C,(¢) where 
C,(t) = C/V Vg + x(t)/N. Show that C(t) = Cy — C,(t) with 1 percent accuracy 
if NV; = 300/4. Then show that the corresponding limitation on the frequency 
deviation is fA < f./300. 


The direct FM generator in Fig. 5.3-2 is used for a remote-controlled toy car. Find 
the range of allowable values for W so that B; satisfies the fractional bandwidth 
requirements, assuming the maximum frequency deviation of 150 KHz is used. 
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Confirm that x(t) = A.cos 6,(t) is a solution of the integrodifferential equation 
x(t) = —6(t) f 0t) x(t) dt. Then draw the block diagram of a direct FM gener- 
ator based on this relationship. 


Suppose an FM detector receives the transmitted signal that was generated by the 
phase modulator in Fig. 5.3-3. Describe the distortion in the output message signal. 
(Hint: Consider the relationship between the message signal amplitude and fre- 
quency, and the modulation index.) 


An audio message signal is transmitted using frequency modulation. Describe the 
distortion on the output message signal if it is received by a PM detector. (Hint: 
Consider the relationship between the message signal amplitude and frequency, and 
the modulation index.) 


Design a wireless stereo speaker system using indirect FM. Assuming W = 15 kHz, 
D = 5, fa = 500 kHz, f. = 915 MHz, and ¢,/27T < 20, determine the number 
of triplers needed in your multiplier stage, and find the value of f; needed to design 
your system. 


The audio portion of a television transmitter is an indirect FM system having 
W = 10 kHz, D = 2.5, and f, = 4.5 MHz. Devise a block diagram of this system 
with $4/2«T < 20 Hz and f, = 200 kHz. Use the shortest possible multiplier 
chain consisting of frequency triplers and doublers, and locate the down-converter 
such that no frequency exceeds 100 MHz. 


A signal with W = 4 kHz is transmitted using indirect FM with f. = 1 MHz and 
fa = 12 KHz. If Q,/27T < 100 and fa = 10 kHz, how many doublers will be 
needed to achieve the desired output parameters? Draw the block diagram of the 
system indicating the value and location of the local oscillator such that no fre- 
quency exceeds 10 MHz. 


Suppose the phase modulator in Fig. 5.3-5 is implemented as in Fig. 5.3-3. Take 
x(t) ^ Am cos o, t and let B = ($A/24T)(A,/f,.). (a) Show that if B < 1, then 


fit) 9 fe + Bf [cos o, t + (B/2) cos 3o, t] 


(b) Obtain a condition on $4/21T so the third-harmonic distortion does not exceed 
1 percent when A,, = 1 and 30 Hz < f,, = 15 kHz, as in FM broadcasting. 


Let the input to Fig. 5.3—7a be an FM signal with fa << f, and let the differentiator 
be implemented by a tuned circuit with H(f) = 1/[1 + j(2Q/fo)(f — fo)] for 
f = fo. Use the quasi-static method to show that yp(t) ~ Kpfa x(t) when fy = f. + b 
provided that fy < b << fy/2Q. 


Let the input to Fig. 5.3—7a be an FM signal with fa < f. and let the differentiator 
be implemented by a first-order lowpass filter with B = f.. Use quasi-static analysis 
to show that yp(t) = —K;fax(t) + Kf 4x (t). Then take x(t) = cos œt and obtain 
a condition on f4/f. so the second-harmonic distortion is less than 1%. 
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The tuned circuits in Fig. 5.3-8b have transfer functions of the form H(f) = 
IEL + INNS — fo] for f = fo). Let the two center frequencies be f; = f, = b with 
fa € b <f, Use quasi-static analysis to show that if both circuits have (2Q/fp)b = 
a «& 1, then yp(t) ^ K,x(t) — Kax?(t) where K4/K, «& 1. 


Obtain an approximate expression for the output of an amplitude demodulator when 
the input is an AM signal with 100 percent modulation plus an interfering signal 
A(1 + x(t)] cos [(o, + w;)t + $;] with p = A//A, « 1. Is the demodulated 
interference intelligible? 


Obtain an approximate expression for the output of a phase demodulator when the 
input is an NBPM signal with 100 percent modulation plus an interfering signal 


A; cos [(w, + w)t + $1] with p = A/A, << 1. Is the demodulated interference 
intelligible? 


Investigate the performance of envelope detection versus synchronous detection of 
AM in the presence of multipath propagation, so that v(t) = x(t) + ax,(t — t4) 
with o? < 1. Consider the special cases «f; ^ 7/2 and «t; ^ ar. 


You are talking on your cordless phone, which uses amplitude modulation, when 
someone turns on a motorized appliance, causing static on the phone. You switch to 
your new FM cordless phone, and the call 1s clear. Explain. 


In World War II they first used preemphasis/deemphasis in amplitude modulation 
for mobile communications to make the high-frequency portion of speech signals 
more intelligible. Assuming that the amplitude of the speech spectrum is bandlimited 
to 3.5 KHz and rolls off at about 6 dB per decade (factor of 10 on a log-frequency 
scale) above 500 Hz, draw the Bode diagrams of the preemphasis and deemphasis fil- 
ters so that the message signal has a flattened spectrum prior to transmission. Discuss 
the impact on the transmitted power for DSB versus standard AM with u = 1. 


Preemphasis filters can also be used in hearing aid applications. Suppose a child has 
a hearing loss that gets worse at high frequencies. A preemphasis filter can be 
designed to be the approximate inverse of the high frequency deemphasis that takes 
place in the ear. In a noisy classroom it is often helpful to have the teacher speak into 
a microphone and have the signal transmitted by FM to a receiver that the child is 
wearing. Is it better to have the preemphasis filter at the microphone end prior to FM 
transmission or at the receiver worn by the child? Discuss your answer in terms of 
transmitted power, transmitted bandwidth, and susceptibility to interference. 


A message signal x(t) has an energy or power spectrum that satisfies the condition 
Gf) - (Ba Jf) G max £l > Bae 


where Gmax is the maximum of G,(f) in | f| < Ba- If the preemphasis filter in Eq. 
(7) is applied to x(t) before FM transmission, will the transmitted bandwidth be 
increased? 
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Equation (8) also holds for the case of unmodulated adjacent-channel interference if 
we let $(t) = w;t. Sketch the resulting demodulated waveform when p = 0.4, 0.8, 
and 1.2. 


If the amplitude of an interfering sinusoid and the amplitude of the sinusoid of inter- 
est are approximately equal, p = A,/A, ~ 1 and Eq. (8b) appears to reduce to 
a(p, $) = 1/2 for all ¢;, resulting in cross talk. However, large spikes will appear 
at the demodulator output when $, = +77. Show that if $; = m and p = 1 + e, 
then a(p, T) — «oo as € > 0. Conversely, show that if p is slightly less than 1 and 
pi= m + e, thena(p, $j) > — case 0. 


Develop an expression for the demodulated signal when an FM signal with instanta- 
neous phase (t) has interference from an unmodulated adjacent-channel carrier. 
Write your result in terms of ¢(t), p = A/A,, and 0(t) = wit + Q; 


d 
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F data and mathematical functions are frequently displayed as continuous curves, even though a finite 
number of discrete points was used to construct the graphs. If these points, or samples, have sufficiently close spac- 
ing, a smooth curve drawn through them allows you to interpolate intermediate values to any reasonable degree of 
accuracy. It can therefore be said that the continuous curve is adequately described by the sample points alone. 

In similar fashion, an electric signal satisfying certain requirements can be reproduced from an appropriate set 
of instantaneous samples. Sampling therefore makes it possible to transmit a message in the form of pulse modulation, 
rather than a continuous signal. Usually the pulses are quite short compared to the time between them, so a pulse- 
modulated wave has the property of being "off" most of the time. 

This property of pulse modulation offers two potential advantages over CW modulation. First, the transmitted 
power can be concentrated into short bursts instead of being generated continuously. The system designer then has 
greater latitude for equipment selection, and may choose devices such as lasers and high-power microwave tubes 
that operate only on a pulsed basis. Second, the time interval between pulses can be filed with sample values from 
other signals, a process called time-division multiplexing (TDM). 

But pulse modulation has the disadvantage of requiring very large transmission bandwidth compared to the mes- 
sage bandwidth. Consequently, the methods of analog pulse modulation discussed in this chapter are used primarily 
as message processing for TDM and/or prior to CW modulation. Digital or coded pulse modulation has additional 
advantages that compensate for the increased bandwidth, as we'll see in Chapter 12. 


OBJECTIVES 


After studying this chapter and working the exercises, you should be able to do each of the following: 


l. Draw the spectrum of a sampled signal (Sect. 6.1). 


2. Define the minimum sampling frequency to adequately represent a signal given the maximum value of aliasing 
error, message bandwidth, LPF characteristics, and so forth (Sect. 6.1). 


Know what is meant by the Nyquist rate and know where it applies (Sect. 6.1). 
Describe the implications of practical sampling versus ideal sampling (Sect. 6.1). 
Reconstruct a signal from its samples using an ideal LPF (Sect. 6.1). 


Explain the operation of pulse-amplitude modulation, pulse-duration modulation, and pulse-position modulation; 
sketch their time domain waveforms; and calculate their respective bandwidths (Sects. 6.2 and 6.3). 


San Pr & 


6.1 SAMPLING THEORY AND PRACTICE 


The theory of sampling presented here sets forth the conditions for signal sampling 
and reconstruction from sample values. We'll also examine practical implementa- 
tion of the theory and some related applications. 


Chopper Sampling 


A simple but highly informative approach to sampling theory comes from the 
switching operation of Fig. 6.1-1a. The switch periodically shifts between two con- 
tacts at a rate of f, = 1/7, Hz, dwelling on the input signal contact for 7 seconds and 
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on the grounded contact for the remainder of each period. The output x,(z) then con- 
sists of short segments for the input x(¢), as shown in Fig. 6.1-15. Figure 6.1-1c is 
an electronic version of Fig. 6.1-1a; the output voltage equals the input voltage 
except when the clock signal forward-biases the diodes and thereby clamps the out- 
put to zero. This operation, variously called single-ended or unipolar chopping, is 
not instantaneous sampling in the strict sense. Nonetheless, x(t) will be designated 
the sampled wave and f, the sampling frequency. 

We now ask: Are the sampled segments sufficient to describe the original input 
signal and, if so, how can x(t) be retrieved from ~x,(t)? The answer to this question 
lies in the frequency domain, in the spectrum of the sampled wave. 

As a first step toward finding the spectrum, we introduce a switching function 
s(t) such that 


x,(t) = x(2)s(t) m 


Thus the sampling operation becomes multiplication by s(t), as indicated schemat- 
ically in Fig. 6.1—2a, where s(£) is nothing more than the periodic pulse train of 
Fig. 6.1-2b. Since s(t) is periodic, it can be written as a Fourier series. Using the 
results of Example 2.1-1 we have 





s(t) — 3 fir sinc nfr eJ?7"55 = qo + Y 26, cos nw,t [2] 
n=—co n=l 
where 
c, = f,T sinc nf,T w, = 2mf, 
ld 
Figure 6.1-1 Switching sampler. (a) Functional diagram; [b] waveforms; (c] circuit realiza- 


tion with diode bridge. 
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This analysis has shown that if a bandlimited signal is sampled at a frequency 
greater than the Nyquist rate, it can be completely reconstructed from the sampled 
wave. Reconstruction is accomplished by lowpass filtering. These conclusions may 
be difficult to believe at first exposure; they certainly test our faith in spectral analy- 
sis. Nonetheless, they are quite correct. 

Finally, it should be pointed out that our results are independent of the sample- 
pulse duration, save as it appears in the duty cycle. If r is made very small, x(ġ 
approaches a string of instantaneous sample points, which corresponds to ideal 
sampling. We'll pursue ideal sampling theory after a brief look at the bipolar chop- 
per, which has 7 = T2. 


Figure 6.1—4a depicts the circuit and waveforms for a bipolar chopper. The equiva- 
lent switching function is a square wave alternating between s(t) = +1 and —1. 
From the series expansion of s( we get 


x(t) = = 0 COS wt — E o cos 3w, t + + o cos SO bom [71 
whose spectrum is sketched in Fig. 6.1—4b for f = 0. Note that X(f) contains no dc 
component and only the odd harmonics of f,. Clearly, we can't recover x(t) by low- 
pass filtering. Instead, the practical applications of bipolar choppers involve band- 
pass filtering. 

If we apply x,(f) to a BPF centered at some odd harmonic nf, the output will be 
proportional to x(f) cos nw,t—-a double-sideband suppressed-carrier waveform. 


x(t) 





Xs) 


0 f 3f, 5f, 
(b) 


Figure 6.1-4 Bipolar chopper. {a} Circuit and waveforms; (b] spectrum. 
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Thus, a bipolar chopper serves as a balanced modulator. It also serves as a synchro- 
nous detector when the input is a DSB or SSB signal and the output is lowpass fil- 
tered. These properties are combined in the chopper-stabilized amplifier, which 
makes possible dc and low-frequency amplification using a high-gain ac amplifier. 
Additionally, Prob. 6.1—4 indicates how a bipolar chopper can be modified to pro- 
duce the baseband multiplexed signal for FM stereo. 


Ideal Sampling and Reconstruction 


By definition, ideal sampling is instantaneous sampling. The switching device of 
Fig. 6.1-1a yields instantaneous values only if 7 — 0; but then f;7 — 0, and so does 
x(t). Conceptually, we overcome this difficulty by multiplying x,(f) by 1/7 so that, 
as T > 0 and 1/r — co, the sampled wave becomes a train of impulses whose areas 
equal the instantaneous sample values of the input signal. Formally, we write the 


rectangular pulse train as 
= t — kT, 
s(t) = X u( : ) 


from which we define the ideal sampling function 





1 co 
s(t) 2 lim = s(t) = Y a(t — kr) [8] 
T9, k=- 
The ideal sampled wave is then 
Xa(t) = x(t)ss(t) [9a] 
= x(t) >) à(t — kT,) 
k=—oo 
= M x(kT,) lt — kT;) [9b] 
k=- 


since x(t) 8(t — kT,) = x(kT,) &(t — kT,). 

To obtain the corresponding spectrum X(f) = #[x5(t)] we note that (1/7)x,(2) = 
Xs(t) as T — O and, likewise, (1/7)X,(f) —> X(f). But each coefficient in Eq. (4) has 
the property c,/7 = f, sinc nf,r = f, when 7 = 0. Therefore, 


Xf) =AXCA) + AUX 5) + Xf + 5)) + 
=f > X(f — nf, 


n--oo 


[10] 


which is illustrated in Fig. 6.1—5 for the message spectrum of Fig. 6.1—3a taking f, > 
2W. We see that X;( f) is periodic in frequency with period f,, a crucial observation 
in the study of sampled-data systems. 
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Xf) 


Figure 6.1-5 Spectrum of ideally sampled message. 


Somewhat parenthetically, we can also develop an expression for S;(f) = 
#([ss(t)] as follows. From Eq. (9a) and the convolution theorem, X;(f) = X(f) * 
Ss(f) whereas Eq. (10) is equivalent to 


X) = xi) «| , AF nt) 
Therefore, we conclude that 
Sf) =f, > OF — nf) m 


so the spectrum of a periodic string of unit-weight impulses in the time domain is a 
periodic string of impulses in the frequency domain with spacing f, = 1/7,; in both 
domains we have a function that looks like a picket fence. 

Returning to the main subject and Fig. 6.1—5, it's immediately apparent that if 
we invoke the same conditions as before—x(1) bandlimited in W and f, = 2W—then 
a filter of suitable bandwidth will reconstruct x(t) from the ideal sampled wave. 
Specifically, for an ideal LPF of gain K, time delay ta, and bandwidth B, the transfer 
function is 


H(f) = kn(Z Jes 
so filtering x&(t) produces the output spectrum 
Y(f) = R(X&(f) = Kf Xf)e 7 
assuming B satisfies Eq. (6). The output time function is then 
y(t) = 9 (Y0)] = Kfx(t — ta) (12] 


which is the original signal amplified by Kf, and delayed by t,. 
Further confidence in the sampling process can be gained by examining recon- 
struction in the time domain. The impulse response of the LPF is 


h(t) = 2BK sinc 2B(t — t4) 


—— 
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And since the input x;(t) is a train of weighted impulses, the output is a train of 
weighted impulse responses, namely, 


y(t) 


A(t) * x(t) = >) x(kT,)A(t — kT;) [13] 


k 


= 2BK Y, x(kT;) sinc 2B(t — t, — kT;) 


k——oo 


Now suppose for simplicity that B = f//2, K = 1/f,, and t, = 0, so 


y(t) = 2,1.) sinc (f,t — k) 

We can then carry out the reconstruction process graphically, as shown in Fig. 6.1-6. 
Clearly the correct values are reconstructed at the sampling instants t = kT,, for all 
sinc functions are zero at these times save one, and that one yields x(kT,). Between 
sampling instants x(t) is interpolated by summing the precursors and postcursors 
from all the sinc functions. For this reason the LPF is often called an interpolation 
filter, and its impulse is called the interpolation function. 

The above results are well summarized by stating the important theorem of uni- 
form (periodic) sampling. While there are many variations of this theorem, the fol- 
lowing form is best suited to our purposes. 





Another way to express the theorem comes from Eqs. (12) and (13) with K = T, and 
tz = 0. Then y(t) = x(t) and 


x(t) = 2BT, >) x(kT,) sinc 2B(t — kT,) [14] 
k--—co 
x(3T,) yO = x) 


x(3T,) sinc (f,t - 3). 








Figure 6.1-6 Ideal reconstruction. 
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provided T, = 1/2W and B satisfies Eq. (6). Therefore, just as a periodic signal is 
completely described by its Fourier series coefficients, a bandlimited signal is com- 


pletely described by its instantaneous sample values whether or not the signal actu- 
ally is sampled. 





EXERCISE 6.1-1 Consider a sampling pulse train of the general form 


s(t) = Y plt — KE) [15a] 


kz —oo 


whose pulse type p(?) equals zero for |t| > 7/2 but is otherwise arbitrary. Use an 
exponential Fourier series and Eq. (21), Sect. 2.2, to show that 


S) =f Xi P(nf) 8 — nf) Lib] 


n---oo 


where P(f) = &[p(t)). Then let p(t) = 8(?) to obtain Eq. (11). 





Practical Sampling and Aliasing 


Practical sampling differs from ideal sampling in three obvious aspects: 


1. The sampled wave consists of pulses having finite amplitude and duration, 
rather than impulses. 


Practical reconstruction filters are not ideal filters. d 


3. The messages to be sampled are timelimited signals whose spectra are not and 
cannot be strictly bandlimited. 





The first two differences may present minor problems, while the third leads to the 
more troublesome effect known as aliasing. 

Regarding pulse-shape effects, our investigation of the unipolar chopper and the 
results of Exercise 6.1—1 correctly imply that almost any pulse shape p( will do 
when sampling takes the form of a multiplication operation x(t)s, (f). Another opera- 
tion produces flat-top sampling described in the next section. This type of sampling 
may require equalization, but it does not alter our conclusion that pulse shapes are 
relatively inconsequential. 

Regarding practical reconstruction filters, we consider the typical filter 
response superimposed in a sampled-wave spectrum in Fig. 6.1—7. As we said ear- 
lier, reconstruction can be done by interpolating between samples. The ideal LPF 
does a perfect interpolation. With practical systems, we can reconstruct the signal 
using a zero-order hold (ZOH) with 


X0 = X«en( 75) T 


k s 
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X) 


Filter response 





Figure 6.1-7 Practical reconstruction filter. 


or a first-order hold (FOH) which performs a linear interpolation using 


y(t) = EGONG 2: [17] 


k s 





The reconstruction process for each of these is shown in Fig. 6.1-8. Both the ZOH 
and FOH functions are lowpass filters with transfer function magnitudes of 

IHzog C] = |T, sinc (FT| and |Hpou (£)| = |Z, VA + (27; sinc'(f7;) |, respec- 
tively. See Problems 6.1-11 and 6.1-12 for more insight. 

If the filter is reasonably flat over the message band, its output will consist of x(7) 
plus spurious frequency components at |f| > f, — W outside the message band. In 
audio systems, these components would sound like high-frequency hissing or "noise." 
However, they are considerably attenuated and their strength is proportional to x(z), so 
they disappear when x(t) = 0. When x(t) # 0, the message tends to mask their pres- 
ence and render them more tolerable. The combination of careful filter design and an 
adequate guard band created by taking f, > 2W makes practical reconstruction filter- 
ing nearly equivalent to ideal reconstruction. In the case of ZOH and FOH reconstruc- 
tion, their frequency response shape sinc(f7,) and sinc? (fT) will distort the spectra of 
x(t). We call this aperature error, which can be minimized by either increasing the 
sampling rate or compensating with the appropriate inverse filter. 





XEOH (t) 


A Xzon (f) 








(a) (b) 


Figure 6.1-8 Signal reconstruction from samples using (a) ZOH, (b) FOH. 
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Regarding the timelimited nature of real signals, a message spectrum like 
Fig. 6.1-9a may be viewed as a bandlimited spectrum if the frequency content above 
W is small and presumably unimportant for conveying the information. When such a 
message is sampled, there will be unavoidable overlapping of spectral components as 
shown in Fig. 6.1—9b. In reconstruction, frequencies originally outside the normal 
message band will appear at the filter output in the form of much lower frequencies. 
Thus, for example, f, > W becomes f, — f, « W, as indicated in the figure. 

This phenomenon of downward frequency translation is given the descriptive 
name of aliasing. The aliasing effect is far more serious than spurious frequencies 
passed by nonideal reconstruction filters, for the latter fall outside the message band, 
whereas aliased components fall within the message band. Aliasing is combated by 
filtering the message as much as possible before sampling and, if necessary, sam- 
pling at higher than the Nyquist rate. This is often done when the antialiasing filter 
does not have a sharp cutoff characteristic, as is the case of RC filters. Let's consider 
a broadband signal whose message content has a bandwidth of W but is corrupted by 
other frequency components such as noise. This signal is filtered using the simple 
first-order RC LPF antialiasing filter that has bandwidth B = 1/27RC with W << B 
and is shown in Figure 6.1—9a. It is then sampled to produce the spectra shown in 
Fig. 6.1-9b. The sbaded area represents the aliased components that have spilled 
into the filter's passband. Observe that the shaded area decreases if f, increases or if 
we employ a more ideal LPF. Assuming reconstruction is done with the first-order 
Butterworth LPF, the maximum percent aliasing error in the passband is 


Cf») 











Figure 6.1-9 Message spectrum. [a] Output of RC filter; (b) after sampling. Shaded area 
represents aliasing spillover into passband. 
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1/0.707 
V1-(f£,/B) 


with f, = f, — B and the 0.707 factor is due to the filter's gain at its half-power fre- 
quency, B. See Ifeachor and Jervis (1993). 


Error9o — x 10096 [18] 





Oversampling EXAMPLE 6.1-2 


When using VLSI technology for digital signal processing (DSP) of analog signals, 
we must first sample the signal. Because it is relatively difficult to fabricate inte- 
grated circuit chips with large values of R and C we use the most feasible RC LPF 
and then oversample the signal at several times its Nyquist rate. We follow with a 
digital filter to reduce frequency components above the information bandwidth W. 
We then reduce the effective sampling frequency to its Nyquist rate using a process 
called downsampling. Both the digital filtering and downsampling processes are 
readily done with VLSI technology. 

Let's say the maximum values of R and C we can put on a chip are 10 kQ and 
100 pF, respectively, and we want to sample a telephone quality voice such that the 


aliased components will be at least 30 dB below the desired signal. Using Eq. (18) 
with 


1 1 
Bis a ÁÉÁRSIÓ (n aÓ 
WRC 2r x 10* x 100-2 i 
we get 
1/0.707 
Sepe [oco ist TD: 


V1 + (£/159 kHz)? 


Solving yields f, = 4.49 MHz, and therefore the sampling frequency is f, = fa + 
B = 4.65 MHz. With our RC LPF, and f, = 4.49 MHZ, any aliased components at 
159 KHz will be at least 5 percent below the signal level at the half-power frequency. 
Of course the level of aliasing will be considerably less than 5 percent at frequencies 
below the telephone bandwidth of 3.2 KHz. 





Sampling Oscilloscopes l EXAMPLE 6.1-3 


A practical application of aliasing occurs in the sampling oscilloscope, which 
exploits undersampling to display high-speed periodic waveforms that would other- 
wise be beyond the capability of the electronics. To illustrate the principle, consider 
the periodic waveform x(£) with period T, = 1/f, in Fig. 6.1-10a. If we use a sam- 
pling interval T, slightly greater than 7, and interpolate the sample points, we get the 
expanded waveform y(t) = x(at) shown as a dashed curve. The corresponding sam- 
pling frequency is 


{<= Cay, 0<a<il 
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y(t) = x(at) 


f. f 0 f 2f, f 
| tb 
~2f5 -fs 4,0 f Ís 2f d 
(d 
Figure 6.1-10 (a) Periodic waveform with undersampling; (b) spectrum of x(t); (c] spectrum 


of y(t) = x(a), a«1. 


so f, < f, and even the fundamental frequency of x(t) will be undersampled. Now let's 
find out if and how this system actually works by going to the frequency domain. 
We assume that x(t) has been prefiltered to remove any frequency components 
higher than the mth harmonic. Figure 6.1-10b shows a typical two-sided line spec- 
trum of x(t), taking m = 2 for simplicity. Since sampling translates all frequency 


components up and down by nf, the fundamental will appear in the spectrum of the 
sampled signal at 


tp stu eat 


as well as at +f, and at f, + nf, = (1 + n)f, + nf,. Similar translations applied to the 
dc component and second harmonic yield the spectrum in Figure 6.1-10c, which 
contains a compressed image of the original spectrum centered at each multiple of f. 


Therefore, a lowpass filter with B = f/2 will construct y(t) = x(at) from x,(t) pro- 
vided that 





« 
2m + 1 


which prevents spectral overlap. 





tl 
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Demonstrate the aliasing effect for yourself by making a careful sketch of cos 27:10t 
and cos 2770tfor0 StS $. Put both sketches on the same set of axes and find the 
sample values at t = 0, $, á... E» which corresponds to f, = 80. Also, convince 
yourself that no other waveform bandlimited in 10 < W < 40 can be interpolated 
from the sample values of cos 2710t. 


EXERCISE 6.1-2 





6.2  PULSE-AMPLITUDE MODULATION 


If a message waveform is adequately described by periodic sample values, it can be 
transmitted using analog pulse modulation wherein the sample values modulate the 
amplitude of a pulse train. This process is called pulse-amplitude modulation 
(PAM). An example of a message waveform and corresponding PAM signal are 
shown in Figure 6.2-1. 

As Figure 6.2-1 indicates, the pulse amplitude varies in direct proportion to the 
sample values of x(t). For clarity, the pulses are shown as rectangular and their dura- 
tions have been grossly exaggerated. Actual modulated waves would also be 
delayed slightly compared to the message because the pulses can't be generated 
before the sampling instances. 

It should be evident from the waveform that a PAM signal has significant dc con- 
tent and that the bandwidth required to preserve the pulse shape far exceeds the mes- 
sage bandwidth. Consequently you seldom encounter a single-channel communica- 
tion system with PAM or, for that matter, other analog pulse-modulated methods. But 
analog pulse modulation deserves attention for its major roles in time-division multi- 
plexing, data telemetry, and instrumentation systems. 


Flat-Top Sampling and PAM 


Although a PAM wave could be obtained from a chopper circuit, a more popular 
method employs the sample-and-hold (S/H) technique. This operation produces 
flat-top pulses, as in Fig. 6.2-1, rather than curved-top chopper pulses. We therefore 
begin here with the properties of flat-top sampling. 
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Figure 6.2-1 
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A rudimentary S/H circuit consists of two FET switches and a capacitor, con- 
nected as shown in Fig. 6.2—2a. A gate pulse at G1 briefly closes the sampling 
switch and the capacitor holds the sampled voltage until discharged by a pulse 
applied to G2. (Commercial integrated-circuit S/H units have further refinements, 
including isolating op-amps at input and output). Periodic gating of the sample-and- 
hold circuit generates the sampled wave 


xp(t) = M x(kT;)p(t — kT;) [1] 
k 
illustrated by Fig. 6.2-2b. Note that each output pulse of duration 7 represents a sin- 
gle instantaneous sample value. 
To analyze flat-top sampling, we'll draw upon the relation p(t — kT,) = p(t) * 
8 (t — kT,) and write 


1) = p ST) 8 - n) | = p «x40 


Fourier transformation of this convolution operation yields 


x4) = P0) | XU - mf) | = POXA) T 


Figure 6.2—3 provides a graphical interpretation of Eq. (2), taking X(f) = II(f/2W). 
We see that flat-top sampling is equivalent to passing an ideal sampled wave 
through a network having the transfer function P(f) = Fp]. 

The high-frequency rolloff characteristic of a typical P(f) acts like a lowpass 
filter and attenuates the upper portion of the message spectrum. This loss of high- 
frequency content is called aperture effect. The larger the pulse duration or aper- 
ture 7, the larger the effect. Aperture effect can be corrected in reconstruction by 
including an equalizer with 


Half) = Ke PCF) [3] 


However, little if any equalization is needed when T/T, << 1. 


Sampling Discharge x(t) x(kT;) 
switch switch 


kT, 





(b) 


Figure 6.2-2 Fla-top sampling. {a} Sample-and-hold circuit; (b) waveforms. 
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X(f) 


-fs 0 Ís 
(a) 
IX Cf) 


IPC f) 





(b) 


Figure 6.2-3 (a) Spectrum for ideal sampling when X(f) = TI[f/2W); (b) aperture effect in 
flattop sampling. 


Now consider a unipolar flat-top PAM signal defined by : 


x(t) = M A1 + ux(kT;)]p(t — KT;) [4] 


The constant A, equals the unmodulated pulse amplitude, and the modulation index 
i. controls the amount of amplitude variation. The condition 


1+ uxt) D [5] 


ensures a unipolar (single-polarity) waveform with no missing pulses. The resulting 
constant pulse rate f, is particularly important for synchronization in time-division 
multiplexing. 

Comparison of Eqs. (1) and (4) shows that a PAM signal can be obtained from 
a sample-and-hold circuit with input Ao[1 + ux(A]. Correspondingly, the PAM 
spectrum will look like Fig. 6.2-3b with X(f) replaced by 


SRAd[1 + ux()]) = Aol df) + uXCO)). 


which results in spectral impulses at all harmonics of f, and at f = 0. Reconstruction of 
x(t) from x (f) therefore requires a dc block as well as lowpass filtering and equalization. 

Clearly, PAM has many similarities to AM CW modulation—modulation 
index, spectral impulses, and dc blocks. (In fact, an AM wave could be derived from 
PAM by bandpass filtering). But the PAM spectrum extends from dc up through sev- 
eral harmonics of f,, and the estimate of required transmission bandwidth B; must be 
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based on time-domain considerations. For this purpose, we assume a small pulse 
duration 7 compared to the time between pulses, so 


1 
«Tsai 
* 2W 


Adequate pulse resolution then requires 
l 


Hence, practical applications of PAM are limited to those situations in which the 
advantages of a pulsed waveform outweigh the disadvantages of large bandwidth. 





EXERCISE 6.2-1 Consider PAM b transmission ms a voice Aid with W~ =~ 23 kHz. Calculate Br iff, — 
8 KHz and 7 = 0.1 T, 











6.3 PULSE-TIME MODULATION 


The sample values of a message can also modulate the time parameters of a pulse 
train, namely the pulse width or its position. The corresponding processes are desig- 
nated as pulse-duration (PDM) and pulse-position modulation (PPM) and are illus- 
trated in Fig. 6.3-1. PDM is also called pulse-width modulation (PWM). Note the 
pulse width or pulse position varies in direct proportion to the sample values of x(t). 


Pulse-Duration and Pulse-Position Modulation 
We lump PDM and PPM together under one heading for two reasons. First, in both 
cases a time parameter of the pulse is being modulated, and the pulses have constant 


amplitude. Second, a close relationship exists between the modulation methods for 
PDM and PPM. 





Figure 6.3-1 Types of pulse-time modulation. 





6.3 Pulse-Time Modulation 


To demonstrate these points, Fig. 6.3-2 shows the block diagram and wave- 
forms of a system that combines the sampling and modulation operations for either 
PDM or PPM. The system employs a comparator and a sawtooth-wave generator 
with period T,. The output of the comparator is zero except when the message wave- 
form x(t) exceeds the sawtooth wave, in which case the output is a positive constant 
A. Hence, as seen in the figure, the comparator produces a PDM signal with trailing- 
edge modulation of the pulse duration. (Reversing the sawtooth results in leading- 
edge modulation while replacing the sawtooth with a triangular wave results in 
modulation on both edges.) Position modulation is obtained by applying the PDM 
signal to a monostable pulse generator that triggers on trailing edges at its input and 
produces short output pulses of fixed duration. 





Comparator 
dd PDM 
PPM 
M O 
Sawtooth onostable 
generator 


la] 
x() 





Figure 6.3-2 Generation of PDM or PPM. (a) Block diagram; {b} waveforms. 


Careful examination of Fig. 6.3—2b reveals that the modulated duration or posi- 
tion depends on the message value at the time location 1, of the pulse edge, rather 
than the apparent sample time kT, Thus, the sample values are nonuniformly spaced. 
Inserting a sample-and-hold circuit at the input of the system gives uniform sampling 
if desired, but there's little difference between uniform and nonuniform sampling in 
the practical case of small amounts of time modulation such that t, — KT, << T, 

If we assume nearly uniform sampling, the duration of the kth pulse in the PDM 
signal is 


ry = Tol + ux(kT,)] n1 
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in which the unmodulated duration 7, represents x(k7T,) = O and the modulation 
index 4 controls the amount of duration modulation. Our prior condition on w in 
Eq. (5), Sect. 6.2, applies here to prevent missing pulses and "negative" durations 
when x(kT,) <= 0. The PPM pulses have fixed duration and amplitude so, unlike 


PAM and PDM, there's no potential problem of missing pulses. The kth pulse in a 
PPM signal begins at time 


te = kT, + tg + ty x(KT;) [2] 


in which the unmodulated position kT, + t; represents x(kT,) = 0 and the constant to 
controls the displacement of the modulated pulse. 

The variable time parameters in Eqs. (1) and (2) make the expressions for x,(t) 
rather awkward. However, an informative approximation for the PDM waveform is 
derived by taking rectangular pulses with amplitude A centered at t = kT, and 
assuming that 7, varies slowly from pulse to pulse. Series expansion then yields 


co 
x(t) ~ Af, Toll + ux(t)] + > A sin no(t) cos nw,t [3] 
n-i 
where f(t) = «f,rg[l + px(t)]. Without attempting to sketch the corresponding 
spectrum, we see from Eq. (3) that the PDM signal contains the message x(t) plus a 
dc component and phase-modulated waves at the harmonics of f,. The phase modu- 
lation has negligible overlap in the message band when To << T, so x(t) can be 
recovered by lowpass filtering with a dc block. 

Another message reconstruction technique converts pulse-time modulation into 
pulse-amplitude modulation, and works for PDM and PPM. To illustrate this tech- 
nique the middle waveform in Fig. 6.3—3 is produced by a ramp generator that starts 
at time kT, stops at tą restarts at (k + 1)T,, and so forth. Both the start and stop 
commands can be extracted from the edges of a PDM pulse, whereas PPM recon- 
struction must have an auxiliary synchronization signal for the start command. 








kT, te +T, 


Figure 6.3-3 Conversion of PDM or PPM into PAM. 
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6.3 Pulse-Time Modulation 


Regardless of the particular details, demodulation of PDM or PPM requires 
received pulses with short risetime in order to preserve accurate message informa- 
tion. For a specified risetime f, << T, the transmission bandwidth must satisfy 


l 
Br = 21, [4] 
which will be substantially greater than the PAM transmission bandwidth. In 
exchange for the extra bandwidth, we gain the benefit of constant-amplitude pulses 
that suffer no ill effects from nonlinear distortion in transmission since nonlinear 
distortion does not alter pulse duration or position. 

Additionally, like PM and FM CW modulation, PDM and PPM have the poten- 
tial for wideband noise reduction—a potential more fully realized by PPM than by 
PDM. To appreciate why this is so, recall that the information resides in the time 
location of the pulse edges, not in the pulses themselves. Thus, somewhat like the 
carrier-frequency power of AM, the pulse power of pulse-time modulation is 
“wasted” power, and it would be more efficient to suppress the pulses and just trans- 
mit the edges! Of course we cannot transmit edges without transmitting pulses to 
define them. But we can send very short pulses indicating the position of the edges, 
a process equivalent to PPM. The reduced power required for PPM is a fundamental 
advantage over PDM, an advantage that becomes more apparent when we examine 
the signal-to-noise ratios. 


Derive Eq. (3) by the following procedure. First, assume constant pulse duration 7 
and write x,(t) = As(f) with s(t) given by Eq. (2), Sect. 6.1. Then apply the quasi- 
static approximation T = Toll + px(t)]. 


PPM Spectral Analysis* 


Because PPM with nonuniform sampling is the most efficient type of analog pulse 
modulation for message transmission, we should take the time to ADAE its spec- 
trum. The analysis method itself is worthy of examination. 

Let the kth pulse be centered at time ¢,. If we ignore the constant time delay t; in 
Eq. (2), nonuniform sampling extracts the sample value at t, rather than kT,, so 


t, = KT, + to x(t,) [5] 


By definition, the PPM wave is a summation of constant-amplitude position-modulated 
pulses, and can be written as 


z3 2,4pe — tj) = Ap(t) * | > ó(t — n| 


where A is the pulse amplitude and p(t) the pulse shape. A simplification at this 
point is made possible by noting that p(t) will (or should) have a very small duration 
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compared to T;. Hence, for our purposes, the pulse shape can be taken as impulsive, 
and 


x,(t) = A >) &(t — tj) [6] 


If desired, Eq. (6) can later be convolved with p(t) to account for the nonimpulsive 
shape. 

In their present form, Eqs. (5) and (6) are unsuited to further manipulation; the 
trouble is the position term ¢,, which cannot be solved for explicitly. Fortunately, t, 
can be eliminated entirely. Consider any function g(t) having a single first-order 
zero at f = A, such that g(A) = 0, g( # O fort # A, and g(t) # 0 att = A. The dis- 
tribution theory of impulses then shows that 


&(t — A) = {al 8(e(o)] [7] 


whose right-hand side is independent of A. Equation (7) can therefore be used to 
remove f, from 6(t — t,) if we can find a function g(f) that satisfies g(t) = O and the 
other conditions but does not contain £;. 


Suppose we take g(t) = t — kT, — tox(t), which is zero at t = kT, + tox(t). Now, 
for a given value of k, there is only one PPM pulse, and it occurs at t, = kT, + tox(t,). 
Thus g(t) = t, — KT, — tox(t,) = 0, as desired. Inserting A = t, g(t) = 1 — to x(t), 
etc., into Eq. (7) gives 


&( — 4) = |1 — tx) Slt Er, — to x(1)] 
and the PPM wave of Eq. (6) becomes 


x(t) = A[1 — to x(t)] X 9(t — to x(t) — KT;] 


The absolute value is dropped since |t)x(f)| < 1 for most signals of interest if tj << T,- 
We then convert the sum of impulses to a sum of exponentials via 


Y Ot kT) =f, v, eo [8) 


k--—oo n--oo 


which is Poisson's sum formula. Thus, we finally obtain 


x(t) = Af[1 — t x(t)] Y, etri) 


n-—-co 


= AfÍ1 — to EXE + M 2cos [nowt — not, x9] [9] 
n=] 
The derivation of Eq. (8) is considered in Prob. 6.3-6. 
Interpreting Eq. (9), we see that PPM with nonuniform sampling is a combina- 
tion of linear and exponential carrier modulation, for each harmonic of f, is phase- 
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modulated by the message x(t) and amplitude-modulated by the derivative x(t). The 
spectrum therefore consists of AM and PM sidebands centered at all multiples of f, 
plus a dc impulse and the spectrum of x(t). Needless to say, sketching such a spec- 
trum is a tedious exercise even for tone modulation. The leading term of Eq. (9) sug- 
gests that the message can be retrieved by lowpass filtering and integrating. How- 
ever, the integration method does not take full advantage of the noise-reduction 
properties of PPM, so the usual procedure is conversion to PAM or PDM followed 
by lowpass filtering. 


6.4 PROBLEMS 


Consider the chopper-sampled waveform in Eq. (3) with + = T,/2, f, = 100 Hz, and 
x(t) = 2 + 2 cos 27730t + cos 27r80t. Draw and label the one-sided line spectrum of 
x(t) for 0 = f = 300 Hz. Then find the output waveform when x,(f) is applied to an 
ideal LPF with B = 75 Hz. 


Do Prob. 6.1-1 with x(t) = 2 + 2 cos 27r30t + cos 277140t. 


The usable frequency range of a certain amplifier is f; to f; + B, with B => fe. Devise 
a system that employs bipolar choppers and allows the amplifier to handle signals 
having significant dc content and bandwidth W «& B. 


The baseband signal for FM stereo is 
xat) = [x,(t) + xa(t)]  [xi(t) — xp(t)] cos wt + A cos w,t/2 


with f; = 38 kHz. The chopper system in Fig. 6.14 is intended to generate this sig- 
nal. The LPF has gain K; for |f| = 15 KHZ, gain K, for 23 = |f| = 53 kHz, and 
rejects |f| = 99 kHz. Use a sketch to show that x,(t) = x,()s() + xp(D[1 — s(£)]. 
where s(t) is a unipolar switching function with 7 = T,/2. Then find the necessary 
values of K, and K;. 


xi) 
——— —0 


x(t) xy) 
x45 PRA uw C-9— +) 
——————90 
- pen 19 kHz 
[v] 
Figure P6.1-4 ia 


A popular stereo decoder circuit employs transistor switches to generate v;(f) = 
x(t) — Kx,(t) and u(t) = x4(t) — Kx,(t) where K is a constant, x,(t) = xj(t)s(t), 
x,(t) = x,(H[1 — s(t], x; (f) is the FM stereo baseband signal in Prob. 6.1—4, and s(t) 
is a unipolar switching function with 7 = 72. (a) Determine K such that lowpass 
filtering of v,(t) and u,(t) yields the desired left- and right-channel signals. (P) What's 
the disadvantage of a simpler switching circuit that has K = 0? 
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Derive Eq. (11) using Eq. (14), Sect. 2.5. 


Suppose x(t) has the spectrum in Fig. P6.1-7 with f, = 25 kHz and W = 10 kHz. 
Sketch xs(/) for f, = 60, 45, and 25 kHz. Comment in each case on the possible 
reconstruction of x(1) from x;(t). 


XCf) 


u u 


= fu -fat W 0 fa- W 
Figure Pó. 1-7 l 


Consider the bandpass signal spectrum in Fig. P6.1—7 whose Nyquist rate is f, = 2f,. 
However, the bandpass sampling theorem states that x(t) can be reconstructed from 
x(t) by bandpass filtering if f, = 2//m and the integer m satisfies (/W) — 1 < m S 
fal W. (a) Find m and plot f,/W versus f,/W for 0 < f,/W = 5. (b) Check the theorem 
by plotting X;( f) when f, = 2.5W and f, = 2.5W. Also show that the higher rate f, = 
4W would not be acceptable. 


The signal x(t) = sinc? 51 is ideally sampled at £ = 0, 0.1, +0.2, ..., and recon- 
structed by an ideal LPF with B — 5, unit gain, and zero time delay. Carry out the 
reconstruction process graphically, as in Fig. 6.1-6 for |¢| = 0.2. 


A rectangular pulse with 7 = 2 is ideally sampled and reconstructed using an ideal 
LPF with B = f/2. Sketch the resulting output waveforms when T, = 0.8 and 0.4, 
assuming one sample time is at the center of the pulse. 


Suppose an ideally sampled wave is reconstructed using a zero-order hold with time 
delay T = T, (a) Find and sketch y(Z) to show that the reconstructed waveform is a 
staircase approximation of x(#). (b) Sketch |Y(f)| for X(f) = II(f/2W) with W < f, 
Comment on the significance of your result. 


The reconstruction system in Fig. P6.1—12 is called a first-order hold. Each block 
labeled ZOH is a zero-order hold with time delay T = T, (a) Find A(t) and sketch 
y(f) to interpret the reconstruction operation. (b) Show that H(f) = T,(1 + j2afT,) 
(sinc? f T) exp (— j27fT,). Then sketch |Y(f)| for X(f) = II(f/2W) with W < f,/2. 





x s(t) y (t) 














Figure P6.1-12 
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Use Parseval's theorem and Eq. (14) with T, — 1/2W and B — W to show that the 
energy of a bandlimited signal is related to its sample values by 


E = (1/20) > paye 


The frequency-domain sampling theorem says that if x(r) is a timelimited signal, 
such that x(t) = 0 for |t| = T, then X(f) is completely determined by its sample val- 
ues X(nfy) with fy = 1/2T. Prove this theorem by writing the Fourier series for the 
periodic signal u(t) = x(t) * [È 6(t — kT )], where Ty = 2T, and using the fact that 
x(t) = v(DII(/2T). 

A signal with period T, = 0.08 us is to be displayed using a sampling oscilloscope 
whose internal high-frequency response cuts off at B = 6 MHz. Determine maxi- 
mum values for the sampling frequency and the bandwidth of the presampling LPF. 


Explain why the sampling oscilloscope in Prob. 6.1—15 will not provide a useful dis- 
play when T, < 1/3B. 


A W = 15 kHz signal has been sampled at 150 kHz. What will be the maximum per- 
cent aperature error if the signal is reconstructed using a (a) ZOH, (b) FOH? 


A W — 15 KHz signal is sampled at 150 kHz with a first-order Butterworth antialias- 
ing filter. What will be the maximum percent aliasing error in the passband? 


Show that the equality in Eq. (5) of Sect. 6.1 does not hold for a sinusoidal signal. 
What is the Nyquist rate to adequately sample the following signals: (a) sinc (1007), 
(b) sinc? (1002), (c) 10 cos?(2r 10%? 

Repeat Example 6.1—2 such that aliased components will be least 40 dB below the 
signal level at the half-power frequency of 159 KHz. 


Sketch |X,(f)| and find H,,(f) for flat-top sampling with 7 = T2, f, = 2.5W, and 
p(t) = I(t/r). Is equalization essential in this case? 


Do Prob. 6.2—1 for p(t) = (cos wt/T)II (t/t). 


Some sampling devices extract from x(t) its average value over the sampling dura- 
tion, so x(kT,) in Eq. (1) is replaced by 


x(kT,) & 1 | ^ dA) dÀ 
F KT,-7 

(a) Devise a frequency-domain model of this process using an averaging filter, with 

input x(t) and output x(t), followed by instantaneous flat-top sampling. Then obtain 

the inpulse response of the averaging filter and write the resulting expression for 

X,(f). (b) Find the equalizer needed when p(t) is a rectangular pulse. 


Consider the PAM signal in Eq. (4). (a) Show that its spectrum is 


XÁf) = APUD > [af — nf) + XQ = "1 
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(b) Sketch IX f)| when p(t) = I(r) with r = T/2 and ux(t) = cos 2mf,/t with 
Ín € SA2. 


Suppose the PAM signal in Eq. (4) is to be transmitted over a transformer-coupled 
channel, so the pulse shape is taken as p(t) = II[(t — 7/2)/7] — II[(t + 7/2)/7] to 
eliminate the dc component of x,(t). (a) Use the expression in Prob. 6.2—4a to sketch 
\X,(f)| when 7 = T44, X(f) = IICÉ/2W), and f, > 2W. (b) Find an appropriate equal- 
izer, assuming that x(t) has negligible frequency content for |f| < f, « W. Why is 
this assumption necessary? 


Show how a PAM signal can be demodulated using a product detector. Be sure to 
describe frequency parameters for the LO and the LPF. 


Calculate the transmission bandwidth needed for voice PDM with f, = 8 kHz, 
|ux(t)| = 0.8, and To = T,/5 when we want t, S 0.257 min- l 


A voice PDM signal with f, = 8 kHz and |ux(t)| = 0.8 is to be transmitted over a chan- 
nel having By = 500 kHz. Obtain bounds on r7, such that 7,4, = 7/3 and Tmin = 3t,- 


A pulse-modulated wave is generated by uniformly sampling the signal x(f) — 
cos 2mt/T,, att = kT,, where T, = T,/3. Sketch and label x,(¢) when the modulation 
is: (a) PDM with u = 0.8, To = 0.4T;, and leading edges fixed at t = kT,; (D PPM 
with t; = 0.57, and tj = 7 = 0.2T,. 

Do Prob. 6.3-3 with T, = T,,/6. 


Use Eq. (9) to devise a system that employs a PPM generator and produces narrow- 
band phase modulation with f, = mf,. 


Poisson's sum formula states in general that 


eo ] ceo 
b ea PnnAIL =L D 8(A = mL) 


n=-0o m--co 


where A is an independent variable and L is a constant. (a) Derive the time-domain 
version as given in Eq. (8) by taking #7![S,(f)]. (b) Derive the frequency-domain 
version by taking F[s;(ġ]. 


Let g(t) be any continuous function that monotonically increases or decreases over 
a € t X b and crosses zero at t = A within this range. Justify Eq. (7) by making the 
change-of-variable v = g(t) in 

b 

d[ g(t) ] dt 
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C systems that employ linear or exponential CW modulation may differ in many respects—type of 
modulation, carrier frequency, transmission medium, and so forth. But they have in common the property that a sinu- 
soidal bandpass signal with timevarying envelope and/or phase conveys the message information. Consequently, 
generic hardware items such as oscillators, mixers, and bandpass filters are important building blocks for all CW mod- 
ulation systems. Furthermore, many systems involve both linear and exponential CW modulation techniques. 

This chapter therefore takes a broader look at CW modulation systems and hardware, using concepts and 
results from Chapters 4 through 6. Specific topics include CW receivers, frequency and timedivision multiplexing, 
phase-lock loops, and television systems. 


OBJECTIVES 








After studying this chapter and working the exercises, you should be able to do each of the following: 


1. Design, in block-diagram form, a superheterodyne receiver that satisfies stated specifications (Sect. 7.1). 
2. Predict at what frequencies a superheterodyne is susceptible to spurious inputs (Sect. 7.1). 


3. Draw the block diagram of either an FDM or TDM system, given the specifications, and calculate the various 
bandwidths (Sect. 7.2). 


4. Identify the phase-locked loop structures used for pilot filtering, frequency synthesis, and FM detection (Sect. 7.3). 


5. Analyze a simple phase-lock-loop system and determine the condition for locked operation (Sect. 7.3). 

6. Explain the following TV terms: scanning raster, field, frame, retrace, luminance, chrominance, and color com- 
patibility (Sect. 7.4). 

7. Estimate the bandwidth requirement for image transmission given the vertical resolution, active line time, and 


aspect ratio (Sect. 7.4). 
8. Describe the significant performance differences of NTSC versus HDTV systems (Sect. 7.4). 


7.1 RECEIVERS FOR CW MODULATION 


All that is really essential in a CW receiver is some tuning mechanism, demodula- 
tion, and amplification. With a sufficiently strong received signal, you may even get 
by without amplification—witness the historic crystal radio set. However, most 
receivers operate on a more sophisticated superheterodyne principle, which we’ ll 
discuss first. Then we’ll consider other types of receivers and the related scanning 
spectrum analyzer. 


Superheterodyne Receivers 


Beside demodulation, a typical broadcast receiver must perform three other opera- 
tions: (1) carrier-frequency tuning to selectthe desired signal, (2) filtering to sepa- 
rate that signal from others received with it, and (3) amplification to compensate for 
transmission loss. And at least some of the amplification should be provided before 
demodulation to bring the signal up to a level useable by the demodulator circuitry. 
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7.1 Receivers for CW Modulation 


For example, if the demodulator is based on a diode envelope detector, the input sig- 
nal must overcome the diode's forward voltage drop. In theory, all of the foregoing 
requirements could be met with a high-gain tunable bandpass amplifier. In practice, 
fractional-bandpass and stability problems make such an amplifier expensive and 
difficult to build. Armstrong devised the superheterodyne or “superhet” receiver to 
circumvent these problems. 

The superhet principle calls for two distinct aaipliication and filtering sec- 
tions prior to demodulation, as diagrammed in Fig. 7.1-1. The incoming signal 
x,(f) is first selected and amplified by a radio-frequency (RF) section tuned to the 
desired carrier frequency f.. The amplifier has a relatively broad bandwidth B5, that 
partially passes adjacent-channel signals along with x,(f). Next a frequency con- 
verter comprised of a mixer and local oscillator translates the RF output down to 


an intermediate-frequency (IF) at fır < fa. The adjustable LO frequency tracks 
with the RF tuning such that 


fio = fe + fir or Áo F fe T Sir (1] 


and hence 


\fe — fiol = fir [2] 


An IF section with bandwidth B; ^ Br now removes the adjacent-channel signals. 
This section is a fixed bandpass amplifier, called the IF strip, which provides most 
of the gain. Finally, the IF output goes to the demodulator for message recovery and 
baseband amplification. The parameters for commercial broadcast AM and FM 
receivers are given in Table 7.1-1. 


Table 7.1-1 Parameters of AM and FM radios 


AM FM 
Carrier frequency 540-1600 kHz 88.1-107.9 MHz 
Carrier spacing 10 kHz 200 kHz 
Intermediate frequency 455 kHz 10.7 MHz 
IF bandwidth 6-10 kHz 200-250 kHz 
Audio bandwidth 3-5 kHz 15 kHz 


The spectral drawings of Fig. 7.1-2 help clarify the action of a superhet 
receiver. Here we assume a modulated signal with symmetric sidebands (as distin- 
guished from SSB or VSB), and we take fro = f. + fig so 


Se fro = fir 


The RF input spectrum in Fig. 7.1—2a includes our desired signal plus adjacent- 
channel signals on either side and another signal at the image frequency 


jo = fe + Yfir = fro + fir [3] 
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+ Bj2 while rejecting the image fre- 


The main task of the RF section is to pass f. 
quency signal. For f; to reach the mixer, it would be down-converted to 


fe — fro = Gio + fir) = fio = fir 
and the image frequency signal would produce an effect similar to cochannel inter- 
ference. Hence, we want an RF response |He-(f)| like the dashed line, with 
Br < Brr < 2fip [4] 


The filtered and down-converted spectrum at the IF input is shown in Fig 7.1—2b. 
The indicated IF response |H,-(f)| with Bj» ^ By completes the task of adjacent- 


channel rejection. 
The superheterodyne structure results in several practical benefits. First, tuning 
takes place entirely in the "front" end so the rest of the circuitry, including the 
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demodulator, requires no adjustment to change f.. Second, the separation between f, 
and fır eliminates potential instability due to stray feedback from the amplified out- 
put to the receiver's inputs. Third, most of the gain and selectivity is concentrated in 
the fixed-frequency IF strip. Since fj; is an internal design parameter, it can be cho- 
sen to obtain a reasonable fractional bandwidth B,,/f,- for ease of implementation. 
Taken together, these benefits make it possible to build superhets with extremely 
high gain—75 dB or more in the IP strip alone. We can also employ high-Q mechan- 
ical, ceramic, crystal, and SAW bandpass filters and thus achieve tremendous reduc- 
tions in adjacent channel interference. 

Additionally, when the receiver must cover a wide carrier-frequency range, the 
choice of fro = f. + fır may result in a smaller and more readily achieved LO tun- 
ing ratio. For example, with AM broadcast radios, where 540 « f. « 1600 kHz and 
fig = 455 kHz using fro = f. + fie results in 995 < fro < 2055 kHz and thus a LO 
tuning range of 2:1. On the other hand, if we chose fro = fe — fie, then for the same 
IF and input frequency range, we get 85 < fro < 1145 kHz or a LO tuning range of 
13:1. We should point out, however, that taking fro > f. in an SSB superhet causes 
sideband reversal in the down-converted signal, so USSB at RF becomes LSSB at 
IF, and vice versa. 

A major disadvantage of the superhet structure is its potential for spurious 
responses at frequencies beside f,. Image-frequency response is the most obvious 
problem. The radio of Figure 7.1-1 employs a tunable BPF for image rejection. 
Given today's integrated electronics technology, high-Q tunable BPFs may not be 
economical and thus some other means of image rejection must be employed. Rais- 
ing fyr will increase the spacing between f. and f; and thus reduce the requirements 
for the RF amplifier’s BPF. In fact, if we set fip high enough, we could use a more 
economical LPF for image rejection. 

But images are not the only problem superhets face with respect to spurious 
responses. Any distortion in the LO signal will generate harmonics that get mixed 
with a spurious input and be allowed to pass to the IF strip. That's why the LO must 
be a "clean" sine wave. The nonsinusoidal shape of digital signals 1s loaded with har- 
monics and thus if a receiver contains digital circuitry, special care must be taken to 
prevent these signals from "leaking" into the mixer stage. Further problems come 
from signal feedthrough and nonlinearities. For example, when a strong signal fre- 
quency near į f;p gets to the IF input, its second harmonic may be produced if the first 
stage of the IF amplifier is nonlinear. This second harmonic, approximately fi, will 
then be amplified by later stages and appear at the detector input as interference. 

Superheterodyne receivers often contain an automatic gain control (AGC) 
such that the receiver's gain is automatically adjusted according to the input signal 
level. AGC is accomplished by rectifying the receiver's audio signal, thus calculat- 
ing its average value. This dc value is then fed back to the IF or RF stage to increase 
or decrease the stage's gain. An AM radio usually includes an automatic volume 
control (AVC) signal from the demodulator back to the IF, while an FM receiver 
has automatic frequency control (AFC) fed back to the LO to correct small fre- 
quency drifts. 
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Superhets and spurious signal response 
A superhet receiver with fir = 500 kHz and 3.5 < fzo < 4.0 MHz has a tuning dial 
calibrated to receive signals from 3 to 3.5 MHz. It is set to receive a 3.0-MHz signal. 
The receiver has a broadband RF amplifier, and it has been found that the LO has a 
significant third harmonic output. If a signal is heard, what are all its possible carrier 
frequencies? With fo = 3.5 MHz, f. = fio — fie = 3.5 — 0.5 = 3.0 MEZ, and the 


- image frequency is f? = f. + 2f;- = 4.0 MHz. But the oscillator's third harmonic is 


10.5 MHz and thus f” = 3fro — fig = 10.5 — 0.5 = 10.0 MHz. The corresponding 
image frequency is then f," = f? + 2f = 10 + 1 = 11 MHz. Therefore, with this 
receiver, even though the dial states the station is transmitting at 3.0 MHz, it actually 
may also be 4, 10, or 11 MHz. 


Determine the spurious frequencies for the receiver of Example 7.1—1 if fip = 7.0 
MHz with 10 = fro = 10.5 MHz and the local oscillator outputs a third harmonic. 
What would the minimum spurious input rejection be in dB, if the receiver’s input 
was preceded by a first-order Butterworth LPF with B = 4 MHz. 





Direct Conversion Receivers 


Direct conversion receivers (DC) are a class of tuned-RF (TRF) receivers that 
consist of an RF amplifier followed by a product detector and suitable message 
amplification. They are called homodyne receivers. A DC receiver is diagrammed 
in Fig. 7.1-3. Adjacent channel interference rejection is accomplished by the LPF 
after the mixer. The DC receiver does not suffer from the same image problem that 
affects the superhet and because of improved circuit technology, particularly with 
higher gain and more stable RF amplifiers, it is capable of good performance. The 
DC’s simplicity lends itself to subminiature wireless sensor applications. 

The DC’s chief drawback is that it does not reject the image signal that is pres- 
ent in the opposite sideband and is thus more susceptible to noise and interference. 
Fig. 7.14 illustrates the output from two single-tone SSB signals, one transmitting 
at the upper sideband, or f, + fi, and an interfering signal at the lower sideband, orf, 
— f». Both f, and f; will appear at the receiver’s output. However, the system shown 
in Figure 7.1—4, which was originally developed by Campbell (1993), eliminates the 
other sideband. If the nodes in Figure 7.1—4 are studied, the receiver's output only 
contains the upper sideband f. + f, signal. 


Special-Purpose Receivers 


Other types of receivers used for special purposes include the heterodyne, the TRF, 
and the double-conversion structure. A heterodyne receiver is a superhet without 
the RF section, which raises potential image-frequency problems. Such receivers 
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Figure 7.1-3 Direct conversion receiver. 
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Figure 7.1-4 Direct conversion receiver with opposite sideband rejection. 


can be built at microwave frequencies with a diode mixer preceded by a fixed 
microwave filter to reject images. In addition to the DC TRF receiver, we can also 
have a TRF using a tunable RF amplifier and envelope detector. The classic crystal 
radio is the simplest TRF. 

A double-conversion receiver in Fig. 7.1—5 takes the superhet principle one 
step further by including two frequency converters and two IF sections. The second 
IF is always fixed-tuned, while the first IF and second LO may be fixed or tunable. 
In either case, double conversion permits a larger value of fır., to improve image 
rejection in the RF section, and a smaller value of fir. to improve adjacent-channel 
rejection in the second IF. High-performance receivers for SSB and shortwave AM 
usually improve this design strategy. 


264 CHAPTER7 € Analog Communication Systems 


Demod 





Figure 7.1-5 Double-conversion receiver. 


Notice that a double-conversion SSB receiver with synchronous detection 
requires three stable oscillators plus automatic frequency control and synchroniza- 
tion circuitry. Fortunately IC technology has made the frequency synthesizer avail- 


able for this application. We'll discuss frequency synthesis using phase-lock loops 
in Sect. 7.3. 


Receiver Specifications 


We now want to consider several parameters that determine the ability of a receiver 
to successfully demodulate a radio signal. Receiver sensitivity is the minimum 
input voltage that produces a specified signal-to-noise ratio at the output of the IF 
section. A good quality short-wave radio typically has a sensitivity of 14V for a 
40 dB signal-to-noise ratio. Dynamic range is the difference between the largest 
input signal that will not become distorted and the smallest signal that can be dis- 
cerned, and is an important parameter. It is measured in dB. The maximum dynamic 
range of most analog amplifiers is 100 dB. Let's say we are listening to a weak AM 
broadcast signal and there is a strong station transmitting at a significantly different 
frequency, but within the RF amplifier's passband. The strong signal can overload i 
the RF amplifier and thus wipe out the weak signal. Selectivity specifies a receiver's | 
ability to discriminate against adjacent channel signals. It is a function of the IF ! 
strip’s BPF. The noise figure indicates how much the receiver degrades the input i 
signal's signal-to-noise ratio and is calculated by j 


(S/N) input 
(S/ N ) output 


Typical noise figures are 5-10 dB. Finally image rejection is defined as 


Nf = 


RR = 10 log|Har (f-)/Harr (F)? dB [6] 


A typical value of image rejection is 50 dB. This equation may apply to other types 
of spurious inputs as well. 


7. Receivers for CW Modulation 





Suppose a superhet’s RF section is a typical tuned circuit described by Eq. (17), 
Sect. 4.1, with f, = f. and Q = 50. Show that achieving RR = 60 dB requires f/f; ~= 
20 when f? = f. + 2fje. This requirement could easily be satisfied by a double con- 
version receiver with fip_, ^ 9.5f.. 





Scanning Spectrum Analyzers* 


If the LO in a superhet is replaced by a VCO, then the predetection portion acts like 
a voltage-tunable bandpass amplifier with center frequency f; = fio + fi, and band- 
width B = Bj. This property is at the heart of the scanning spectrum analyzer in 
Fig. 7.1-6a—a useful laboratory instrument that displays the spectral magnitude of 
an input signal over some selected frequency range. 

The VCO is driven by a periodic ramp generator that sweeps the instantaneous 
frequency folt) linearly from f, to f; in T seconds. The IF section has a narrow band- 
width B, usually adjustable, and the IF output goes to an envelope detector. Hence, 
the system's amplitude response at any instant t looks like Fig. 7.1-6b, where f(t) = 
Fro) — fip. A fixed BPF (or LPF) at the input passes f, = f = f; while rejecting the 
image at fo(t) + 2f;r. 

As f(t) repeatedly scans past the frequency components of an input signal v(t), 
its spectrum is displayed by connecting the envelope detector and ramp generator to 
the vertical and horizontal deflections of an oscilloscope. Obviously, a transient sig- 
nal would not yield a fixed display, so v(t) must be either a periodic or quasi-periodic 
signal or a stationary random signal over the time of observation. Correspondingly, 


vO. BPF (x) = 
fro | 
VCO 


IH Cf) IHgpr(f )l 


A———--4AX----------« 






Ramp 
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fi fo = h fo Ð + fir 


fro - fie 
(b) 
Figure 7.1-6 Scanning spectrum analyzer, (a] Block diagram; (b) amplitude response. 
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the display represents the amplitude line spectrum or the power spectral density. (A 
square-law envelope detector would be used for the latter.) 

Some of the operational subtleties of this system are best understood by assum- 
ing that u(t) consists of two or more sinusoids. To resolve one spectral line from the 
others, the IF bandwidth must be smaller than the line spacing. Hence, we call B the 
frequency resolution, and the maximum number of resolvable lines equals (f, — 


f,)/B. The IF output produced by a single line takes the form of a bandpass pulse 
with time duration 


T = BT/(f, — fi) = B/fo 


where fy = (f; — f;/T represents the frequency sweep rate in hertz per second. 
But a rapid sweep rate may exceed the IF pulse response. Recall that our guide- 
line for bandpass pulses requires B = 1/7 = f,/B, or 
; fr MA fi 2 
= MR [7] 
This important relation shows that accurate resolution (small B) calls for a slow rate 
and correspondingly long observation time. Also note that Eq. (7) involves four 
parameters adjustable by the user. Some scanning spectrum analyzers have built-in 
hardware that prevents you from violating Eq. (7); others simply have a warning light. 





7.2  MULTIPLEXING SYSTEMS 


When several communication channels are needed between the same two points, 
significant economies may be realized by sending all the messages on one transmis- 
sion facility—a process called multiplexing. Applications of multiplexing range 
from the vital, if prosaic, telephone network, to the glamour of FM stereo and space- 
probe telemetry systems. There are three basic multiplexing techniques: frequency- 
division multiplexing (FDM), time-division multiplexing (TDM), and code-division 
multiplexing, treated in Chapter 15. The goal of these techniques is to enable multi- 
ple users to share a channel, and hence they are referred to as frequency-division 
multiple access (FDMA), time-division multiple access (TDMA), and code-division 
multiple access (CDMA). 


Frequency-Division Multiplexing 


The principle of FDM is illustrated by Fig. 7.2-1a, where several input messages 
(three are shown) individually modulate the subcarriers fs, fen, and so forth, after 
passing through LPFs to limit the message bandwidths. We show the subcarrier 
modulation as SSB as it often is, but any of the CW modulation techniques could be 
employed, or a mixture of them. The modulated signals are then summed to produce 
the baseband signal, with spectrum X,(f) as shown in Fig. 7.2-1b. (The designation 
"baseband" indicates that final carrier modulation has not yet taken place.) The 
baseband time function x,(t) is left to your imagination. 
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Figure 7.2-1 Typical FDM transmitter. (a) Input spectra and block diagram; (b) baseband 
FDM spectrum. 


Assuming that the subcarrier frequencies are properly chosen, the multiplexing 
operation has assigned a slot in the frequency domain for each of the individual 
messages in modulated form, hence the name frequency-division multiplexing. 
The baseband signal may then be transmitted directly or used to modulate a trans- 
mitted carrier of frequency f.. We are not particularly concerned here with the nature 
of the final carrier modulation, since the baseband spectrum tells the story. 

Message recovery or demodulation of FDM is accomplished in three steps por- 
trayed by Fig. 7.2-2. First, the carrier demodulator reproduces the baseband signal 
x(t). Then the modulated subcarriers are separated by a bank of bandpass filters in 
parallel, following which the messages are individually detected. 

The major practical problem of FDM is cross talk, the unwanted coupling of 
one message into another. Intelligible cross talk (cross-modulation) arises primarily 
because of nonlinearities in the system which cause one message signal to appear as 
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Figure 7.2-2 Typical FDM receiver. 


modulation on another subcarrier. Consequently, standard practice calls for negative 
feedback to minimize amplifier nonlinearity in FDM systems. (As a matter of his- 
torical fact, the FDM cross talk problem was a primary motivator for the develop- 
ment of negative-feedback amplifiers.) 

Unintelligible cross talk may come from nonlinear effects or from imperfect 
spectral separation by the filter bank. To reduce the latter, the modulated message 
spectra are spaced out in frequency by guard bands into which the filter transition 
regions can be fitted. For example, the guard band marked in Fig. 7.2-1b is of width 
fa — (fa + Wi). The net baseband bandwidth is therefore the sum of the modulated 
message bandwidths plus the guard bands. But the scheme in Fig. 7.2-2 is not the 
only example of FDM. The commercial AM or FM broadcast bands are everyday 
examples of FDMA, where several broadcasters can transmit simultaneously in the 
same band, but at slightly different frequencies. 


FDMA Satellite Systems 
The Intelsat global network adds a third dimension to long-distance communication. 
Since a particular satellite links several ground stations in different countries, various 
access methods have been devised for international telephony. One scheme, known 
as frequency-division multiple access (FDMA), assigns a fixed number of voice 
channels between pairs of ground stations. These channels are grouped with standard 
FDM hardware, and relayed through the satellite using FM carrier modulation. 

For the sake of example, suppose a satellite over the Atlantic Ocean serves 
ground stations in the United States, Brazil, and France. Further suppose that 36 
channels (three groups) are assigned to the US—France route and 24 channels (two 
groups) to the US-Brazil route. Figure 7.2-3 shows the arrangement of the US 
transmitter and the receivers in Brazil and France. Not shown are the French and 
Brazilian transmitters and the US receiver needed for two-way conversations. Addi- 
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Figure 7.2-3 Simplified FDMA satellite system. 


tional transmitters and receivers at slightly different carrier frequencies would pro- 
vide a Brazil-France route. 

The FDMA scheme creates at the satellite a composite FDM signal assembled 
with the FM signals from all ground stations. The satellite equipment consists of a 
bank of transponders. Each transponder has 36-MHz bandwidth accommodating 
336 to 900 voice channels, depending on the ground-pair assignments. More details 
and other access schemes can be found in the literature. | 





Suppose an FDM baseband amplifier has cube-law nonlinearity which produces a 
baseband component proportional to (v; cos wt)’v, cos «t, where f, and f; are two 
subcarrier frequencies. Show that AM subcarrier modulation with v, = 1 + x,(4) 
and v, = 1 + x;(t) results in both intelligible and unintelligible cross talk on subcar- 
rier fj. Compare with the DSB case v, = x,(4) and v, = x;(t). 





Figure 7.2-4a diagrams the FDM system that generates the baseband signal for FM 
stereophonic broadcasting. The left-speaker and right-speaker signals are first 
matrixed and preemphasized to produce x; (f) + xp(t) and x; (t) — xg(t). The sum sig- 
nal is heard with a monophonic receiver; matrixing is required so the monaural lis- 
tener will not be subjected to sound gaps in program material having stereophonic 
Ping-Pong effects. The x,(t) + x&(f) signal is then inserted directly into the base- 
band, while x;(f) — x,(t) DSB modulates a 38-kHz subcarrier. Double-sideband 
modulation is employed to preserve fidelity at low frequencies, and a 19-kHz pilot 
tone is added for receiver synchronization. 
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Figure 7.2-4 FM stereo multiplexing. (c) Transmitter; (b) baseband spectrum. 


The resulüng baseband spectrum is sketched in Fig. 7.2-4b. Also shown is 
another spectral component labeled SCA, which stands for Subsidiary Communica- 
tion Authorization. The SCA signal has NBFM subcarrier modulation and is transmit- 
ted by some FM stations for the use of private subscribers who pay for commercial- 
free program material—the so-called “wallpaper music” heard in stores and offices. 

For stereo broadcasting without SCA, the pilot carrier is allocated 10 percent of 
the peak frequency deviation and the seesaw relationship between L + R and L — R 
permits each to achieve nearly 90 percent deviation. The fact that the baseband spec- 
trum extends to 53 KHz (or 75 kHz with SCA) does not appreciably increase the 
transmission bandwidth requirement because the higher frequencies produce smaller 
deviation ratios. High-fidelity stereo receivers typically have By = 250 kHz. 

The stereo demultiplexing or decoding system is diagrammed in Fig. 7.2—5. 
Notice how the pilot tone is used to actuate the stereo indicator as well as for syn- 
chronous detection. Integrated-circuit decoders employ switching circuits or phase- 
lock loops to carry out the functional operations. 

Incidentally, discrete four-channel (quadraphonic) disk recording takes a logi- 
cal extension of the FM stereo strategy to multiplex four independent signals on the 
two channels of a stereophonic record. Let's denote the four signals as Lp, Lg, Rp, 
and Rg (for left-front, left-rear, etc.). The matrixed signal Lr + Lp is recorded 
directly on one channel along with Lr — Lg multiplexed via frequency modulation 
of a 30-kHz subcarrier. The matrixed signals Rp + Rpand Rp — Rp are likewise mul- 
tiplexed on the other channel. Because the resulting baseband spectrum goes up to 
45 kHz, discrete quadraphonic signals cannot be transmitted in full on stereo FM. 
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Figure 7.2-5 FM stereo multiplex receiver. 


Other quadraphonic systems have only two independent channels and are compati- 
ble with FM stereo. 


Quadrature-Carrier Multiplexing 


Quadrature-carrier multiplexing, also known as quadrature amplitude modulation 
(QAM), utilizes carrier phase shifting and synchronous detection to permit two DSB 
signals to occupy the same frequency band. Figure 7.2-6 illustrates the multiplexing 
and demultiplexing arrangement. The transmitted signal has the form 


x(t) = A,[x,(t) cos wt + x,(t) sin o t] n 


Since the modulated spectra overlap each other, this technique is more properly 
characterized as frequency-domain rather than frequency-division multiplexing. 
From our prior study of synchronous detection for DSB and SSB, you should 
readily appreciate the fact that QAM involves more stringent synchronization than, 
say, an FDM system with SSB subcarrier modulation. Hence, QAM is limited to 
specialized applications, notably color television and digital data transmission. 
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Figure 7.2-6 Quadrature-carrier multiplexing. 
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Time-Division Multiplexing 


A sampled waveform is “off’ most of the time, leaving the time between samples 
available for other purposes. In particular, sample values from several different sig- 
nals can be interlaced into a single waveform. This is the principle of time-division 
multiplexing (TDM) discussed here. 

The simplified system in Fig. 7.2—7 demonstrates the essential features of time- 
division multiplexing. Several input signals are prefiltered by the bank of input 
LPFs and sampled sequentially. The rotating sampling switch or commutator at the 
transmitter extracts one sample from each input per revolution. Hence, its output is 
a PAM waveform that contains the individual samples periodically interlaced in 
time. A similar rotary switch at the receiver, called a decommutator or distributor, 
separates the samples and distributes them to another bank of LPFs for reconstruc- 
tion of the individual messages. 

If all inputs have the same message bandwidth W, the commutator should rotate 
at the rate f, = 2W so that successive samples from any one input are spaced by T, = 
1/f, = 1/2W. The time interval T, containing one sample from each input is called a 
frame. If there are M input channels, the pulse-to-pulse spacing within a frame is 
T,/M = 1/Mf,. Thus, the total number of pulses per second will be 


r = Mf, = 2MW [2] 


which represents the pulse rate or signaling rate of the TDM signal. 
Our primitive system shows mechanical switching to generate multiplexed 
PAM. But almost all practical TDM systems employ electronic switching. Further- 
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Figure 7.2-7 TDM system. (a) Block diagram; {b} waveforms. 


ZEE i 


Aoi, hls a S LC eo aaa t Tuta CENTRO us TES AI rt cca rad a CR Lt at c ON oed ata caida 


svn iia ie a RIP ONE NP 





7.2 Multiplexing Systems 


more, other types of pulse modulation can be used instead of PAM. Therefore, a more 
generalized commutator might have the structure diagrammed in Fig. 7.2-8, where 
pulse-modulation gates process the individual inputs to form the TDM output. The 
gate control signals come from a flip-flop chain (a broken-ring counter) driven by a 
digital clock at frequency Mf,. The decommutator would have a similar structure. 
Regardless of the type of pulse modulation, TDM systems require careful syn- 
chronization between commutator and decommutator. Synchronization is a critical 
consideration in TDM, because each pulse must be distributed to the correct output 
line at the appropriate time. A popular brute-force synchronization technique 
devotes one time slot per frame to a distinctive marker pulse or nonpulse, as illus- 
trated in Fig. 7.2-9. These markers establish the frame frequency f, at the receiver, 
but the number of signal channels is reduced to M — 1. Other synchronization meth- 
ods involve auxiliary pilot tones or the statistical properties of the TDM signal itself. 
Radio-frequency transmission of TDM necessitates the additional step of CW 
modulation to obtain a bandpass waveform. For instance, a TDM signal composed 
of duration or position-modulated pulses could be applied to an AM transmitter with 
100 percent modulation, thereby producing a train of constant-amplitude RF pulses. 





(b) 
Figure 7.2-8 (a) Electronic commutator for TDM; (b) timing diagram. 
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The compound process would be designated PDM/AM or PPM/AM, and the 
required transmission bandwidth would be twice that of the baseband TDM signal. 
The relative simplicity of this technique suits low-speed multichannel applications 
such as radio control for model airplanes. 

More sophisticated TDM systems may use PAM/SSB for bandwidth conserva- 
tion or PAM/FM for wideband noise reduction. The complete transmitter diagram in 
Fig. 7.2-10a now includes a lowpass baseband filter with bandwidth 


B, = år = iMf, [3] 


Baseband filtering prior to CW modulation produces a smooth modulating wave- 
form x,(f) having the property that it passes through the individual sample values at 
the corresponding sample times, as portrayed in Fig. 7.2-10b. Since the interlaced 
sample spacing equals 1/Mf,, the baseband filter constructs x,(¢) in the same way 
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Figure 7.2-10 {a} TDM transmitter with baseband filtering; (b) baseband waveform. 
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that an LPF with B = f/2 would reconstruct a waveform x(t) from its periodic sam- 
ples x(kT,) with T, = L/f 

If baseband filtering is employed, and if the sampling frequency is close to the 
Nyquist rate f, _ = 2W for the individual inputs, then the transmission bandwidth 
for PAM/SSB becomes l 


Br =4M X 2W = MW 


Under these conditions, TDM approaches the theoretical minimum bandwidth of 
frequency-division multiplexing with SSB subcarrier modulation. 

Although we've assumed so far that all input signals have the same bandwidth, 
this restriction is not essential and, moreover, would be unrealistic for the important 
case of analog data telemetry. The purpose of a telemetry system is to combine and 
transmit physical measurement data from different sources at some remote location. 
The sampling frequency required for a particular measurement depends on the phys- 
ical process involved and can range from a fraction of one hertz up to several kilo- 
hertz. A typical telemetry system has a main multiplexer plus submultiplexers 
arranged to handle 100 or more data channels with various sampling rates. 





TDM Telemetry 
For the sake of illustration, suppose we need 5 data channels with minimum sampling 
rates of 3000, 700, 600, 300, and 200 Hz. If we used a 5-channel multiplexer with f, = 
3000 Hz for all channels, the TDM signaling rate would be r = 5 X 3000 = 15 kHz— 
not including synchronization markers. A more efficient scheme involves an 8- 
channel main multiplexer with f, = 750 Hz and a 2-channel submultiplexer with f, = 
375 Hz connected as shown in Fig. 7.2-11. 

The two lowest-rate signals x4(£) and x(t) are combined by the submultiplexer 
to create a pulse rate of 2 X 375 = 750 Hz for insertion into one channel of the 


Sampling rate, Hz 














Signal Minimum Actual 
x(D 3000 4x750 
xy) 700 750 TDM 
output 
xy() 600 750 
x(D 300 1/2x750 
xs(t) 200 1/2x750 
Clock 
E: 
Figure 7.2-11 TDM telemetry system with main multiplexer and submultiplexer. 
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main multiplexer. Hence, the samples of x,(f) and x5(;7) will appear in alternate 
frames. On the other hand, the highest-rate signal x,(f) is applied to four inputs on 
the main multiplexer. Consequently, its samples appear in four equispaced slots 
within each frame, for an equivalent sampling rate of 4 X 750 — 3000 Hz. The total 
output signaling rate, including a marker, is r = 8 X 750 Hz = 6 kHz. Baseband 
filtering would yield a smoothed signal whose bandwidth B, — 3 kHz fits nicely 
into a voice telephone channel! 
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Suppose the output in Fig. 7.2—11 is an unfiltered PAM signal with 50 percent duty 
cycle. Sketch the waveform for two successive frames, labeling each pulse with its 
source signal. Then calculate the required transmission bandwidth B; from Eq. (6), 
Sect. 6.2. 
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Cross Talk and Guard Times 


When a TDM system includes baseband filtering, the filter design must be done 
with extreme care to avoid interchannel cross talk from one sample value to the next 
in the frame. Digital signals suffer a similar problem called intersymbol interfer- 
ence, and we defer the treatment of baseband waveform shaping to Sect. 11.3. 

A TDM signal without baseband filtering also has cross talk if the transmission 
channel results in pulses whose tails or postcursors overlap into the next time slot of 
the frame. Pulse overlap is controlled by establishing guard times between pulses, 
analogous to the guard bands between channels in an FDM system. Practical TDM 
systems have both guard times and guard bands, the former to suppress cross talk, 
the latter to facilitate message reconstruction with nonideal filters. 

For a quantitative estimate of cross talk, let's assume tbat the transmission 
channel acts like a first-order lowpass filter with 3-dB bandwidth B. The response to 
a rectangular pulse then decays exponentially, as sketched in Fig. 7.2-12. The guard 
time T, represents the minimum pulse spacing, so the pulse tail decays to a value no 
larger than A, = Ae~?™878 by the time the next pulse arrives. Accordingly, we define 
the cross-talk reduction factor. 


ka 2 10log(A,/AY ~ —54.5 BT, dB [4] 


Figure 7.2-12 Cross talk in TDM. 
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Figure 7.2-13 TDM/PPM with guard time. 


Keeping the cross talk below —30 dB calls for T, > 1/2B. 

Guard times are especially important in TDM with pulse-duration or pulse- 
position modulation because the pulse edges move around within their frame slots. 
Consider the PPM case in Fig. 7.2-13: here, one pulse has been position-modulated 
forward by an amount żọ and the next pulse backward by the same amount. The 
allowance for guard time T, requires that T, + 2t; + 2(7/2) = TJM or 


ver 
fj hs M "^ ^h [5] 


A similar modulation limit applies in the case of PDM. 





Nine voice signals plus a marker are to be transmitted via PPM on a channel having EXERCISE 7.2-3 
B = 400 kHz. Calculate T, such that E =~ —60 dB. Then find the maximum permit- 
ted value of ty if f, = 8 kHz and T = 4(T. did 





Comparison of TDM and FDM 


Time-division and frequency-division multiplexing accomplish the same end by dif- 
ferent means. Indeed, they may be classified as dual techniques. Individual TDM 
channels are assigned to distinct time slots but jumbled together in the frequency 
domain; conversely, individual FDM channels are assigned to distinct frequency 
slots but jumbled together in the time domain. What advantages then does each offer 
over the other? 

Many of the TDM advantages are technology driven. TDM is readily imple- 
mented with high-density VLSI circuitry where digital switches are extremely eco- 
nomical. Recall that FDM requires an analog subcarrier modulator, bandpass filter, 
and demodulator for every message channel. These are relatively expensive to 
implement in VLSI. But all of these are replaced by a TDM commutator and decom- 
mutator switching circuits, easily put on a chip. However, TDM synchronization is 
only slightly more demanding than that of suppressed-carrier FDM. 

Second, TDM is invulnerable to the usual causes of cross talk in FDM, namely, 
imperfect bandpass filtering and nonlinear cross-modulation. However, TDM cross- 


talk immunity does depend on the transmission bandwidth and the absence of delay 
distortion. 
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Third, the use of submultiplexers allows a TDM system to accommodate differ- 
ent signals whose bandwidths or pulse rates may differ by more than an order of 
magnitude. This flexibility has particular value for multiplexing digital signals, as 
we'll see in Sect. 12.5. 

Finally, TDM may or may not be advantageous when the transmission medium 
is subject to fading. Rapid wideband fading might strike only occasional pulses 
in a given TDM channel, whereas all FDM channels would be affected. But siow 
narrowband fading wipes out all TDM channels, whereas it might hurt only one 
FDM channel. 

Many systems such as satellite relay are a hybrid of FDMA and TDMA. 
For example, we have FDMA where specific frequency channels will be allocated 
to various services. In turn then, each channel may be shared by individual users 
using TDMA. 





7.3  PHASE-LOCK LOOPS 


The phase-lock loop (PLL) is undoubtedly the most versatile building block avail- 
able for CW modulation systems. PLLs are found in modulators, demodulators, fre- 
quency synthesizers, multiplexers, and a variety of signal processors. We'll illustrate 
some of these applications after discussing PLL operation and lock-in conditions. 
Our introductory study provides a useful working knowledge of PLLs but does not 
go into detailed analysis of nonlinear behavior and transients. Treatments of these 
advanced topics are given in Blanchard (1976), Gardner (1979), Meyr and Ascheid 
(1990), and Lindsey (1972). 


PLL Operation and Lock-In 


The basic aim of a PLL is to lock or synchronize the instantaneous angle (i.e., 
phase and frequency) of a VCO output to the instantaneous angle of an external 
bandpass signal that may have some type of CW modulation. For this purpose, the 
PLL must perform phase comparison. We therefore begin with a brief look at 
phase comparators. 

The system in Fig. 7.3-1a is an analog phase comparator. It produces an out- 
put y(t) that depends on the instantaneous angular difference between two bandpass 
input signals, x(t) = A, cos @,(f) and u(t) = A, cos 0,(t). Specifically, if 


O(t) = 6,(t) — e(t) + 90° uU 


and if the LPF simply extracts the difference-frequency term from the product 
x(t)u(t), then 


y(t) = $ A, A, cos [6,(t) — 6,(0)] 
= } A, A, cos [e(t) — 90°] = 1 AL A, sin e(t) 


7.3  Pbhase-Lock Loops 


x(t) = A, cos 6, (t) y(t) «^ AA, sine (2) l 
o LPF : h Ac Ay 
| —180° uu 
v(t) = A, cos 0, (2) 


6, (t) = 6,0) ~ elt) + 90° 





fa) 


xét) f Switching x) 


Lim v(t) 





(b) 
Figure 7.3-1 Phase comparators. (a) Analog; (b) digital. 


We interpret e(f) as the angular error, and the plot of y versus e emphasizes that 
y(t) = 0 when e(t) = 0. Had we omitted the 90° shift in Eq. (1), we would get y(?) = 0 
at e(t) = +90°. Thus, zero output from the phase comparator corresponds to a quad- 
rature phase relationship. 

Also note that y(t) depends on A, A, when e(t) 0, which could cause problems 
if the input signals have amplitude modulation. These problems are eliminated by 
the digital phase comparator in Fig. 7.3-1b, where hard limiters convert input sinu- 
soids to square waves applied to a switching circuit. The resulting plot of y versus € 
has a triangular or sawtooth shape, depending on the switching circuit details. How- 
ever, all three phase-comparison curves are essentially the same when |«(2)| << 
90?—the intended operating condition in a PLL. 

Hereafter, we'll work with the analog PLL structure in Fig. 7.3-2. We assume 
for convenience that the external input signal has constant amplitude A, — 2 so that 
x,(t) = 2 cos 6,(t) where, as usual, 


O(t) = et + P(t) o, = 2nf. (2] 


| 


v(t) = cos 8,(r) 


x(t) = 2 cos 8 (t) y(t) = Kasin e (1) 





Figure 7.3-2 Phase-lock loop. 
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We also assume a unit-amplitude VCO output v(t) = cos 6,(¢) and a loop amplifier 
with gain K,. Hence, 

y(t) = K, sin e(t) [3] 
which is fed back for the control voltage to the VCO. 

Since the VCO’s free-running frequency with y(t) = 0 may not necessarily 
equal f, we'll write it as f, = f. — Af where Af stands for the frequency error. Appli- 
cation of the control voltage produces the instantaneous angle 

8,(t) = 2n(f, — Af)t + $,(t) + 90° [4a] 
with 


t 


P(t) = 2v K, | yA) dà [4b] 


when K, equals the frequency-deviation constant. The angular error is then 
e(t) = 8(t) — 6,(t) + 90? 
= 2m Aft + dt) — pt) 
and differentiation with respect to t gives 
e(t) = 2m Af + ó(r) — 27K, y(t) 


Upon combining this expression with Eq. (3) we obtain the nonlinear differential 
equation. 


e(t) + 27K sin e(t) = 2m Af + A(t) [5] 


in which we've introduced the loop gain 
K2 KK, 


This gain is measured in hertz and turns out to be a critical parameter. 

Equation (5) governs the dynamic operation of a PLL, but it does not yield a 
closed-form solution with an arbitrary $(f). To get a sense of PLL behavior and 
lock-in conditions, consider the case of a constant input phase $(7) = ġo starting at 
t = 0. Then $(t) = 0 and we rewrite Eq. (5) as 

1 Af 
L—— (t) + sine(t) = — t20 6] 
Sup eS 
Lock-in with a constant phase implies that the loop attains a steady state with €(t) = 0 
and e(f) = €, Hence, sin €, = Af/K at lock-in, and it follows that 


a= [7a] 
= sin — a 
Ess arcsin K 
A 
Yss 7 Ka sin Ess = E) [7b] 


K, 





7.3  Phase-Lock Loops 


U,(t) = cos (o, t + do — e,, + 90°) [7c] 


Note that the nonzero value of y,, cancels out the VCO frequency error, and v,,(f) is 
locked to the frequency of the input signal x,(t). The phase error e, will be negligi- 
ble if |Af/K| «& 1. 

However, Eq. (6) has no steady-state solution and e,, in Eq. (7a) is undefined 
when |Af/K| > 1. Therefore, lock-in requires the condition 


K = |Af| [8] 


Stated another way, a PLL willlock to any constant input frequency within the range 
+ K hertz of the VCO’s free-running frequency f}. 

Additional information regarding PLL behavior comes from Eq. (6) when we 
require sufficient loop gain that e,, ~ 0. Then, after some instant tj > 0, e(t) will be 
small enough to justify the approximation sin e(f) = e(t) and 

1 


one) *te(t) 20 t2t [9a] 


This linear equation yields the well-known solution 
e(t) = e(t)e 779 tzt [9b] 


a transient error that virtually disappears after five time constants have elapsed, that is, 
e(t) = 0 for t > tg + 5/(27K). We thus infer that if the input x (f) has a time-varying 
phase (t) whose variations are slow compared to 1/(27rK), and if the instantaneous 
frequency f. + à(t)/2« does not exceed the range of f, + K, then the PLL will stay in 
lock and track $(f) with negligible error—provided that the LPF in the phase com- 
parator passes the variations of f(t) on to the VCO. 





€ = 2m(Af — Ksine) 


(a) Sketch € versus e for K = 2 Af and show that an arbitrary initial value e(0) must 
go to &, = 30? + m 360? where m is an integer. Hint: e(t) increases when é(t) > 0 
and decreases when &(f) < 0. (b).Now sketch the phase-plane plot for K < Af to 
show that |e(r)| > 0 for any e(t) and, consequently, e,, does not exist. 


Synchronous Detection and Frequency Synthesizers 


The lock-in ability of a PLL makes 1t ideally suited to systems that have a pilot car- 
rier for synchronous detection. Rather than attempting to filter the pilot out of the 
accompanying modulated waveform, the augmented PLL circuit in Fig. 7.3-3 can 
be used to generate a sinusoid synchronized with the pilot. To minimize clutter here, 
we've lumped the phase comparator, lowpass filter, and amplifier into a phase dis- 
criminator (PD) and we've assumed unity sinusoidal amplitudes throughout. 


The phase-plane plot of € versus eis defined by rewriting Eq. (6) in the form 
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cos (w,t + dg) 





plus 
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waveform inse det 






Lock-in 
indicator 


Figure 7.3-3 PLL pilot filter with two phase discriminators (PD). 


Initial adjustment of the tuning voltage brings the VCO frequency close to f, 
and e,, = 0, a condition sensed by the quadrature phase discriminator and displayed 
by the lock-in indicator. Thereafter, the PLL automatically tracks any phase or fre- 
quency drift in the pilot, and the phase-shifted VCO output provides the LO signal 
needed for the synchronous detector. Thus, the whole unit acts as a narrowband 
pilot filter with a virtually noiseless output. 

Incidentally, a setup like Fig. 7.3~3 can be used to search for a signal at some 
unknown frequency. You disconnect the VCO control voltage and apply a ramp gen- 
erator to sweep the VCO frequency until the lock-in indicator shows that a signal has 
been found. Some radio scanners employ an automated version of this procedure. 

.For synchronous detection of DSB without a transmitted pilot, Costas invented 
the PLL system in Fig. 7.3—4. The modulated DSB waveform x(t) cos w,t with band- 
width 2W is applied to a pair of phase discriminators whose outputs are proportional 


to x(t) sin €, and x(f) cos Es Multiplication and integration over T >> 1/W produces 
the VCO control voltage l 


l ; E 
ys ^ T(x?(t)) sin ej, cos €,, = Es S, sin 2e,, 


x(t) sin €, 










t 
cos (wt — €,, + 90°) ee —(x) Multiplier 


Ys i S Sin 2€,, 


x(t) cos ot 








x(t) cos Ess Output 


Figure 7.3-4 Costas PLL system for synchronous detection. 


: 





7.3 Phase-Lock Loops 


If Af ~ 0, the PLL locks with €, ^ O and the output of the quadrature discriminator 
is proportional to the demodulated message x(z). Of course the loop loses lock if 
x(t) = 0 for an extended interval. 

The frequency-offset loop in Fig. 7.3—5 translates the input frequency (and 
phase) by an amount equal to that of an auxiliary oscillator. The intended output fre- 
quency is now f. + fı, so the free-running frequency of the VCO must be 


f= Ger hi) BP OLS 


The oscillator and VCO outputs are mixed and filtered to obtain the difference- 
frequency signal cos [6,(t) — (vt + ¢,)] applied to the phase discriminator. Under 
locked conditions with e, = 0, the instantaneous angles at the input to the discrimi- 
nator will differ by 90°. Hence, 6,(f) — (wt + $4) = wt + bp + 90°, and the VCO 
produces cos [(w, + w,)t + dp + $, + 90°]. 

By likewise equating instantaneous angles, you can confirm that Fig. 7.3—6 per- 
forms frequency multiplication. Like the frequency multiplier discussed in Sect. 5.2, 
this unit multiplies the instantaneous angle of the input by a factor of n. However, it 
does so with the help of a frequency divider which is easily implemented using a 
digital counter. Commercially available divide-by-n counters allow you to select 
any integer value for n from 1 to 10 or even higher. When such a counter is inserted 
in a PLL, you have an adjustable frequency multiplier. . 


cos (wt + bo) cos [(w, + w;)t + ho + $, + 90°] 
=", PD VCO 
fo =f, tf 


LPF 















cos (wt + o)) M 
cos 0, 


Mixer 


Figure 7.3-5 Frequency-offset loop. 


cos (wt + po) F— — —4 cos (nwt + npo + 190°) 
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Figure 7.3-6 PLL frequency multiplier. 
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A frequency synthesizer starts with the output of one crystal-controlled master 
oscillator; various other frequencies are synthesized therefrom by combinations of 
frequency division, multiplication, and translation. Thus, all resulting frequencies 
are stabilized by and synchronized with the master oscillator. General-purpose labo- 
ratory synthesizers incorporate additional refinements and have rather complicated 
diagrams. So we'll illustrate the principles of frequency synthesis by an example. 


Suppose a double-conversion SSB receiver needs fixed LO frequencies at 100 kHz 
(for synchronous detection) and 1.6 MHz (for the second mixer), and an adjustable 
LO that covers 9.90-9.99 MHz in steps of 0.01 MHz (for RF tuning). The custom- 
tailored synthesizer in Fig. 7.3—7 provides all the required frequencies by dividing 
down, multiplying up, and mixing with the output of a 10-MHz oscillator. You can 
quickly check out the system by putting-a frequency-multiplication block in place of 
each PLL with a divider. 

Observe here that all output frequencies are less than the master-oscillator fre- 
quency. This ensures that any frequency drift will be reduced rather than increased 
by the synthesis operations. 


(10 ~0.01n) 
MHz 





0.1 MHz 





0.01 MHz 


0.01n MHz 





Frequency synthesizer with fixed and adjustable outputs. 


Draw the block diagram of a PLL system that synthesizes the output frequency nf m 
from a master-oscillator frequency f.. State the condition for locked operation in 
terms of the loop gain K and the VCO free-running frequency f,. 
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Linearized PLL Models and FM Detection 


Suppose that a PLL has been tuned to lock with the input frequency f,, so Af = 0. 
Suppose further that the PLL has sufficient loop gain to track the input phase $(t) 
within a small error e(t), so sin e(t) ^ e(t) = f(t) — d, (n). These suppositions con- 
stitute the basis for the linearized PLL model in Fig. 7.3-8a, where the LPF has been 
represented by its impulse response A(t). 

Since we'll now focus on the phase variations, we view (2) as the input “sig- 
nal" which is compared with the feedback "signal" 


t 


b(t) = 27K, | y) d 


to produce the output y(t). We emphasize that viewpoint by redrawing the linearized 
model as a negative feedback system, Fig. 7.3-8b. Note that the VCO becomes an 
integrator with gain 27rK,, while phase comparison becomes subtraction. 

Fourier transformation finally takes us to the frequency-domain model in 
Fig. 7.3-8c, where D(f) = FIED], H(f) = FIRA], and so forth. Routine analysis 
yields 

KHU) — gis = L HKA) 
1+ Ka H(£(K,/jf) K, jf + KH(f) 





YF) = D(f) [10] 


xf) 2 2 cos [wt + $A] 


> 


v(t) = cos [wet  $, (t) + 90°] 







y(t) 


(a) 


P(t) + " —— > x0 
t 
dul) = 














(b) 
XP i ———- Hf) > Y(f) 
of) Klif 











{c} 


Figure 7.3-8 Linearized PLL models. (a) Time domain; (b] phase; [c] frequency domain. 
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which expresses the frequency-domain relationship between the input phase and 
output voltage. 


Now let x,(t) be an FM wave with $(t) = 27f, x(t) and, accordingly, 


D) = 2nfa XSG) = f/f )X(f) 
Substituting for P(f) in Eq. (10) gives 


iu) = É ny ma 


where 


H(f) 
H(f) + j(f/K) 


which we interpret as the equivalent loop transfer function. If X(f) has message 
bandwidth W and if 


Hf) = [11b] 


A(f) =1 f| = Ww [12a] 
then H,(f) takes the form of a first-order lowpass filter with 3-dB bandwidth K, namely 
1 
H E TA sW [12b] 
Thus, Y(f) ~ FK X(f) when K = W so 
y(t) = A y [13] 


Under these conditions, the PLL recovers the message x(t) from x,(f) and thereby 
serves as an FM detector. 

A disadvantage of the first-order PLL with H(f) = 1 is that the loop gain K 
determines both the bandwidth of H;( f) and the lock-in frequency range. In order to 
track the instantaneous input frequency f(t) = f, + fAx(t) we must have K = fa. The 
large bandwidth of H;(f) may then result in excessive interference and noise at the 
demodulated output. For this reason, and other considerations, H;(f) is usually a 
more sophisticated second-order function in practical PLL frequency detectors. 


7.4 TELEVISION SYSTEMS 


The message transmitted by a television is a two-dimensional image with motion, 
and therefore a function of two spatial variables as well as time. This section intro- 
duces the theory and practice of image transmission via an electrical signal. Our ini- 
tial discussion of monochrome (black and white) video signals and bandwidth 
requirements also applies to facsimile systems which transmit only still pictures. 
Then we'll describe TV transmitters, in block-diagram form, and the modifications 
needed for color television. 





7.4 Television Systems 


There are several types of television systems with numerous variations found in 
different countries. We'll concentrate on the NTSC (National Television System 
Committee) system used in North America, South America, and Japan and its digi- 
tal replacement, the HDTV (high-definition television). More details about HDTV 
are given by Whitaker (1999), and ATSC (1995). 


Video Signals, Resolution, and Bandwidth 


To start with the simplest case, consider a motion-free monochrome intensity pat- 
tern /(h, v ), where h and v are the horizontal and vertical coordinates. Converting 
K(h, v) to a signal x(t)—and vice versa—requires a discontinuous mapping process 
such as the scanning raster diagrammed in Fig. 7.4—1. The scanning device, which 
produces a voltage or current proportional to intensity, starts at point A and moves 
with constant but unequal rates in the horizontal and vertical directions, following 
the path AB. Thus, if s, and s, are the horizontal and vertical scanning speeds, the . 
output of the scanner is the video signal 


x(t) = I(s, t, s, 1) [1] 


since h = s,t, and so forth. Upon reaching point B, the scanning spot quickly flies 
back to C (the horizontal retrace) and proceeds similarly to point D, where facsimile 
scanning would end. 

In TV, however, image motion must be accommodated, so the spot retraces ver- 
tically to E and follows an interlaced pattern ending at F. The process is then 
repeated starting again at A. The two sets of lines are called the first and second 
fields; together they constitute one complete picture or frame. The frame rate is just 
rapid enough (25 to 30 per second) to create the illusion of continuous motion, while 
the field rate (twice the frame rate) makes the flickering imperceptible to the human 
eye. Hence, interlaced scanning allows the lowest possible picture repetition rate 
without visible flicker. 











Figure 7.4-1 Scanning raster with two fields (line spacing grossly exaggerated). 
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Figure 7.4-2 Video waveform for one full line (NTSC standards]. 


Two modifications are made to the video signal after scanning: blanking pulses 
are inserted during the retrace intervals to blank out retrace lines on the receiving 
picture tube; and synchronizing pulses are added on top of the blanking pulses to 
synchronize the receiver's horizontal and vertical sweep circuits. Figure 7.4—2 
shows the waveform for one complete line, with amplitude levels and durations cor- 
responding to NTSC standards. Other parameters are listed in Table 7.4—1 along 
with some comparable values for the European CCIR (International Radio Consul- 
tative Committee) system and the high-definition (HDTV) system. 











Table7.4-1 . Television system parameters 
NISC .— CCIR HDTV/USA 

Aspect ratio, horizontal/vertical 4/3 4/3 16/9 

Total of lines per frame 525 625 1125 

Field frequency, Hz 60 50 60 

Line frequency, kHz 15.75 15.625 33.75 

Line time, us 63.5 64 29.63 

Video bandwidth, MHz 4.2 5.0 24.9 

Optimal viewing distance 7H 7H 3H 

Sound Mono/Stereo Mono/Stereo 6 channel Dolby 
output output Digital Surround 

Horizontal retrace time, uS 10 3.7 

Vertical retrace, lines/field 21 45 





Analyzing the spectrum of the video signal in absence of motion is relatively 
easy with the aid of Fig. 7.4—3 where, instead of retraced scanning, the image has 
been periodically repeated in both directions so the equivalent scanning path is 
unbroken. Now any periodic function of two variables may be expanded as a two- 
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7.4 Television Systems 

















Figure 7.4-3 


Periodically repeated image with unbroken scanning path. 


dimensional Fourier series by straightforward extension of the one-dimensional 


series. For the case at hand with H and V the horizontal and vertical periods (includ- 
ing retrace allowance), the image intensity is 


oye >. e285 (2 + z) | v 


m=—co n=- H V 
where 
pope h 
EI | | Kh, v) exp [os (I + zar dh dv [3] 
Therefore, letting 
Sh pe 
Sh H Ío Vy 


and using Eqs. (1) and (2), we obtain 


oo co 
x(t) = W VW ejm 


m-- -con--co 


[4] 


This expression represents a doubly periodic signal containing all harmonics of the 
line frequency f, and the field frequency f,, plus their sums and differences. Since 
fn 7? f, and since |c,,,| generally decreases as the product mn increases, the ampli- 
tude spectrum has the form shown in Fig. 7.4—4, where the spectral lines cluster 
around the harmonics of f, and there are large gaps between clusters. 
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Figure 7.4-4 Video spectrum for still image. 


Equation (4) and Fig. 7.4—4 are exact for a still picture, as in facsumile systems. 
When the image has motion, the spectral lines merge into continuous clumps around 
the harmonics of f;. Even so, the spectrum remains mostly "empty" everywhere else, 
a property used to advantage in the subsequent development of color TV. Despite the 
gaps in Fig. 7.4—4, the video spectrum theoretically extends indefinitely—similar to 
an FM line spectrum. Determining the bandwidth required for a video signal thus 
involves additional considerations. 

Two basic facts stand in the way of perfect image reproduction: (1) there can be 
only a finite number of lines in the scanning raster, which limits the image clarity or 
resolution in the vertical direction; and (2) the video signal must be transmitted with 
a finite bandwidth, which limits horizontal resolution. Quantitatively, we measure 
resolution in terms of the maximum number of discrete image lines that can be dis- 
tinguished in each direction, say n, and n,. In other words, the most detailed image 

that can be resolved is taken to be a checkerboard pattern having n, columns and n, 
rows. We usually desire equal horizontal and vertical resolution in lines per unit dis- 
tance, so n/H = n/V and 


[5] 


which is called the aspect ratio. 

Clearly, vertical resolution is related to the total number of raster lines N; 
indeed, n, equals N if all scanning lines are active in image formation (as in facsim- 
ile but not TV) and the raster aligns perfectly with the rows of the image. Experi- 
mental studies show that arbitrary raster alignment reduces the effective resolution 
by a factor of about 70 percent, called the Kerr factor, so 


n, = 0.7(N — N,,) [6] 


where N,, is the number of raster lines lost during vertical retrace. 
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Horizontal resolution is determined by the baseband bandwidth B allotted to the 
video signal. If the video signal is a sinusoid at frequency fax = B, the resulting pic- 
ture will be a sequence of alternating dark and light spots spaced by one-half cycle 
in the horizontal direction. It then follows that 


ny, = 2B(Tine — Thr) [7] 
Where Tine is the total duration of one line and T, is the horizontal retrace time. 
Solving Eq. (7) for B and using Eas. (5) and (6) yields 
H/V)n, N-N, 
a haa = T 
2(Tiine = Thr) Tine 7 Thr 


Another, more versatile bandwidth expression is obtained by multiplying both sides 
of Eq. (8) by the frame time Thame = NTine and explicitly showing the desired reso- 
lution. Since N = n,/0.7(1 — N,,/N), this results in 


Bs 0.714n, ae 
Tame = 9a 
i ( - i) ( - zt) 
N Tine 
where 
n, = Sni = nan, [9b] 


V 


The parameter n, represents the number of picture elements or pixels. Equation (9) 
brings out the fact that the bandwidth (or frame time) requirement increases in pro- 
portion to the number of pixels or as the square of the vertical resolution. 





The NTSC system has N — 525 and N,, — 2 X 2] — 42 so there are 483 active lines. 
The line time is Tine = 1/f, = 63.5 us and T,, = 10 us, leaving an active line time of 
53.5 us. Therefore, using Eq. (8) with H/V = 4/3, we get the video bandwidth 


4 483 
B = 0.35 X = Xx —————— 
3 53.5 x 107° 


EXAMPLE 7.4-1] 
= 4.2 MHz 


This bandwidth is sufficiently large to reproduce the 5-us sync pulses with reason- 
ably square corners. 





EXERCISE 7.4-1 


Facsimile systems require no vertical retrace and the horizontal retrace time is neg- 
ligible. Calculate the time Tame needed for facsimile transmission of a newspaper 
page, 37 by 59 cm, with a resolution of 40 lines/cm using a voice telephone channel 
with B = 3.2 kHz. 
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Monochrome Transmitters and Receivers 


The large bandwidth and significant low-frequency content of the video signal, 
together with the desired simplicity of envelope detection, have led to the selection 
of VSB + C (as described in Sect. 4.4) for TV broadcasting in the United States. 
However, since precise vestigial sideband shaping is more easily carried out at the 
receiver where the power levels are small, the actual modulated-signal spectrum is 
as indicated in Fig. 7.4—5a. The half-power frequency of the upper sideband is about 
4.2 MHz above the video carrier fẹ while the lower sideband,has a 1-MHz band- 
width. Figure 7.4—5b5 shows the frequency shaping at the receiver. 

The audio signal is frequency-modulated on a separate carrier fj, = fy + fas 
with f, = 4.5 MHz and frequency deviation fa = 25 kHz. Thus, assuming an audio 
bandwidth of 10 kHz, D = 2.5 and the modulated audio occupies about 80 kHz. TV 
channels are spaced by 6 MHz, leaving a 250-kHz guard band. Carrier frequencies 
are assigned in the VHF ranges 54-72, 76-88, and 174—216 MHz, and in the UHF 
range 470—806 MHz. 

The essential parts of a TV transmitter are block-diagrammed in Fig. 7.4—6. The 
synchronizing generator controls the scanning raster and supplies blanking and sync 
pulses for the video signal. The dc restorer and white clipper working together 
ensure that the amplified video signal levels are in proportion. The video modulator 
is of the high-level AM type with = 0.875, and the power amplifier removes the 
lower portion of the lower sideband. 

The antenna has a balance-bridge configuration such that the outputs of the 
audio and video transmitters are radiated by the same antenna without interfering 
with each other. The transmitted audio power is 50 to 70 percent of the video power. 

As indicated in Fig. 7.4-7, a TV receiver is of the superheterodyne type. The 
main IF amplifier has f; in the 41- to 46-MHz range and provides the vestigial shap- 
ing per Fig. 7.4—5b. Note that the modulated audio signal is also passed by this 


Video carrier Audio carrier 
f- f, MHz 
-125 -0.75 0 4.0 4.5 4.75 
(a) 
f- fa, MHz 
-0.75 0 0.75 4.5 4.75 
(bi 


Figure 7.4-5 (a) Transmitted TV spectrum; {b} VSB shaping at receiver. 
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Figure 7.4-6 Monochrome TV transmitter. 
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Figure 7.4-7 Monochrome TV receiver. 


amplifier, but with substantially less gain. Thus, drawing upon Eq. (11), Sect. 4.4, 
the total signal at the input to the envelope detector is 


y(t) = All + nx(t)] cos out — Ad«uxét) sin ot [10] 


where x(f) is the video signal, $(f) is the FM audio, and w, = 27 fa. Since |ux,(?)| «€ 1 
and Aca << A,,, the resulting envelope is approximately 


A(t) = Aall + ux(t)] + Aa cos [wat + P(t) ] [11] 


which gives the signal at the output of the envelope detector. 

The video amplifier has a lowpass filter that removes the audio component from 
A(t) as well as a dc restorer that electronically clamps the blanking pulses and 
thereby restores the correct dc level to the video signal. The amplified and dc- 
restored video signal is applied to the picture tube and to a sync-pulse separator that 
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provides synchronization for the sweep generators. The "brightness" control permits 
manual adjustment of the DC level while the "contrast" control adjusts the gain of 
the IF amplifier. 

Equation (11) shows that the envelope detector output also includes the modu- 
lated audio. This component is picked out and amplified by another IF amplifier 
tuned to 4.5 MHz. FM detection and amplification then yields the audio signal. 

Observe that, although the transmitted composite audio and video signal is a 
type of frequency-division multiplexing, separate frequency conversion is not 
required for the audio. This 1s because the video carrier acts like a local oscillator for 
the audio in the envelope-detection process, an arrangement called the intercarrier- 
sound system having the advantageous feature that the audio and video are always 
tuned in together. Successful operation depends on the fact that the video compo- 
nent 1s large compared to the audio at the envelope detector input, as made possible 
by the white clipper at the transmitter (which prevents the modulated video signal 
from becoming too small) and the relative attenuation of the audio by the receiver's 
IF response. 

Some additional features not shown on our transmitter and receiver diagrams 
relate to the vertical retrace interval. The NTSC system allots 21 lines per field to 
vertical retracing, or about 1.3 ms every 1/60 sec. The first 9 lines carry control 
pulses, but the remaining 12 may be utilized for other purposes while the retrace 
goes on. Applications of these available lines include: the vertical-interval test signal 

(VITS) for checking transmission quality; the vertical-interval reference (VIR) for 
receiver servicing and/or automatic adjustments; and digital signals that generate 
the closed-captioning characters on special receivers for the hearing impaired. 





Use a phasor diagram to derive Eq. (11) from Eq. (10). 





Color "Television 


Any color can be synthesized from a mixture of the three additive primary colors, 
red, green, and blue. Accordingly, a brute-force approach to color TV would involve 
direct transmission of three video signals, say x,(t), xo(t), and xg(t)—one for each 
primary. But, aside from the increased bandwidth requirement, this method would 
not be compatible with existing monochrome systems. A fully compatible color TV 
signal that fits into the monochrome channel was developed in 1954, drawing upon 
certain characteristics of human color perception. The salient features of that system 
are outlined here. 

To begin with, the three primary color signals can be uniquely represented by 
any three other signals that are independent linear combinations of xg(t), xg(t), and 
x,(t). And, by proper choice of coefficients, one of the linear combinations can be 
made the same as the intensity or luminance signal of monochrome TV. In particu- 
Jar, it turns out that if 


bine Hee 


mem 


Lu. 


j 
| 





dt; 


As, 


iai ut 
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xy(t) = 0.30xg(t) + 0.59xc(t) + O.11xg(r) [12a] 


then x,(?) is virtually identical to the conventional video signal previously symbol- 
ized by x(t). The remaining two signals, called the chrominance signals, are taken as 


Here, the color signals are normalized such that 0 = x(t) = 1, and so forth, so the 
luminance signal is never negative while the chrominance signals are bipolar. 


Understanding the chrominance signals is enhanced by introducing the color ' 


vector 
xc(t) = x(t) + jxg(t) [13] 


whose magnitude lx,()| is the color intensity or saturation and whose angle arg x,(t) is 
the hue. Figure 7.4—8 shows the vector positions of the saturated primary colors in the 
IQ plane. A partially saturated (pastel) blue-green, for instance, might have x, = 0 and 
Xg = xg = 0.5, so xc = —0.300 — j0.105, lxc| = 0.318, and arg xc = —160°. Since 
the origin of the JQ plane represents the absence of color, the luminance signal may 
be viewed as a vector perpendicular to this plane. 

Because xy(f) serves as the monochrome signal, it must be alloted the entire 4.2- 
MHz baseband bandwidth to provide adequate horizontal resolution. Consequently, 
there would seem to be no room for the chrominance signals. Recall, however, that 
the spectrum of x,(t) has periodic gaps between the harmonics of the line frequency 
f, —and the same holds for the chrominance signals. Moreover, subjective tests have 
shown that the human eye is less perceptive of chrominance resolution than lumi- 
nance resolution, so that x(t) and xo(r) can be restricted to about 1.5 MHz and 
0.5 MHz, respectively, without significant visible degradation of the color picture. 


Red 





Green 


Figure 7.4-8 Saturated primary color vectors in the IQ plane. 


295 


296 


CHAPTER7 € Analog Communication Systems 


Combining these factors permits multiplexing the chrominance signals in an inter- 
leaved fashion in the baseband spectrum of the luminance signal. 

The chrominance signals are multiplexed on a color subcarrier whose fre- 
quency falls exactly halfway between the 227th and 228th harmonic of fp, namely, 


455 
i= A z 3.6 MHz [14] 


Therefore, by extension of Fig. 7.4-3, the luminance and chrominance frequency 
components are interleaved as indicated in Fig. 7.4—9a, and there is 0.6 MHz 
between f,, and the upper end of the baseband channel. The subcarrier modulation 
will be described shortly, after we examine frequency interleaving and compatibility. 

What happens when a color signal is applied to a monochrome picture tube? 
Nothing, surprisingly, as far as the viewer sees. True, the color subcarrier and its 
sidebands produce sinusoidal variations on top of the luminance signal. But because 
all of these sinusoids are exactly an odd multiple of one-half the line frequency, they 
reverse in phase from line to line and from field to field —illustrated by Fig. 7.4—9b. 
This produces flickering in small areas that averages out over time and space to the 
correct luminance value and goes essentially unnoticed by the viewer. 





0 fi l 2275, — f, — 228f, 


la) 








Figure 7.4-9 (a) Chrominance spectral lines [dashed] interleaved between luminance lines; 
(b) line-to-line phase reversal of chrominance variations on luminance. 
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By means of this averaging effect, frequency interleaving renders the color sig- 
nal compatible with an unmodified monochrome receiver. It also simplifies the 
design of color receivers since, reversing the above argument, the luminance signal 
does not visibly interfere with the chrominance signals. There is a minor interfer- 
ence problem caused by the difference frequency f, — f.. between the audio and 
color subcarriers. That problem was solved by slightly changing the line frequency 
to f, = f/286 = 15.73426 kHz giving f, — fse = 4500 — 3,579.545 = 920.455 kHz 
— (107/2)f, which 1s an "invisible" frequency. (As a result of this change, the field 
rate is actually 59.94 Hz rather than 60 Hz!) 

A modified version of quadrature-carrier multiplexing puts both chrominance 
signals on the color subcarrier. Figure 7.4—10 shows how the luminance and chromi- 
nance signals are combined to form the baseband signal x;(/) in a color transmitter. 
Not shown is the nonlinear gamma correction introduced at the camera output to 
compensate for the brightness distortion of color picture tubes. 

The gamma-corrected color signals are first matrixed to obtain xy(£), x,(t), and 
Xg(t) in accordance with Eq. (12). Next, the chrominance signals are lowpass fil- 
tered (with different bandwidths) and applied to the subcarrier modulators. Subse- 
quent bandpass filtering produces conventional DSB modulation for the Q channel 
and modified VSB for the J channel—for example, DSB for baseband frequencies 
of x(t) below 0.5 MHz and LSSB for 0.5 < |f| < 1.5 MHz. The latter keeps the 
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Figure 7.4-10 Color subcarrier modulation system. 
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modulated chrominance signals as high as possible in the baseband spectrum, 
thereby confining the flicker to small areas, while still allowing enough bandwidth 
for proper resolution of x,(t). Total sideband suppression cannot be used owing to 
the significant low-frequency content in x,(t) and xo(t). 

Including x(t), the entire baseband signal becomes 


x,(t) = xy(t) + xolt) sin ot + x(t) cos out + X(t) sin ot [15] 


where X;4(t) is the Hilbert transform of the high-frequency portion of x(t) and 
accounts for the asymmetric sidebands. This baseband signal takes the place of the 
monochrome video signal in Fig. 7.4-6. Additionally, an 8-cycle piece of the color 
subcarrier known as the color burst is put on the trailing portion or “back porch" of 
the blanking pulses for purposes of synchronization. 

Demultiplexing is accomplished in a color TV receiver after the envelope detec- 
tor, as laid out in Fig. 7.4—11. Since the luminance signal is at baseband here, it 
requires no further processing save for amplification and a 3.6-MHz trap or rejection 
filter to eliminate the major flicker component; the chrominance sidebands need not 
be removed, thanks to frequency interleaving. The chrominance signals pass 
through a bandpass amplifier and are applied to a pair of synchronous detectors 
whose local oscillator is the VCO in a PLL synchronized by phase comparison with 
the received color burst. Manual controls usually labeled ‘‘color level” (i.e., satura- 


LPF 
0.5 MHz 


-— — Tint 





Figure 7.4-11 Color demodulation system. 
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tion) and “tint” (i.e., hue) are provided to adjust the gain of the chrominance ampli- 
fier and the phase of the VCO; their effect on the picture is readily explained in 
terms of the color vector and Fig. 7.4-8. 

Assuming good synchronization, it follows from Eq. (15) that the detected but 
unfiltered J- and Q-channel signals are proportional to 


v(t) = x(t) + 2xyg(t) cos wet + x(t) cos 2w,,t [16a] 
+ [xo(t) + ilt) ] sin 2w,,t 


Volt) = xolt) + Xy(t) + 2xyy(t) sin out + x,(t) sin 2w,,t [165] 
— [xo(t) + Xy,(t)] cos Zwet 


where xyj(t) represents the luminance frequency components in the 2.1- to 4.1-MHz 
range. Clearly, lowpass filtering will remove the double-frequency terms, while the 
terms involving xy;,(t) are "invisible" frequencies. Furthermore, %;;,(¢) in Eq. (16b) 
has no components less than 0.5 MHz, so itis rejected by the LPF in the Q channel. 
(Imperfect filtering here results in a bothersome effect called quadrature color cross 
talk). Therefore, ignoring the invisible-frequency terms, x(t) and x(t) have been 
recovered and can then be matrixed with x(t) to generate the color signals for the 
picture tube. Specifically, by inversion of Eq. (12), 


xa(t) = xy(t) + 0.95x,(t) + O.62xo(t) [17] 
xg(t) = xy(t) — 0.28x,(t) — 0.64x9(t) 
xg(t) = xy(t) — 1.10x;(t) + 1.70x,(t) 


If the received signal happens to be monochrome, then the three color signals will be 
equal and the reproduced picture will be black-and-white. This is termed reverse 
compatibility. 

The NTSC color system described here certainly ranks high as an extraordinary 
engineering achievement! It solved the problems of color reproduction with direct and 


reverse monochrome compatibility while staying within the confines of the 6-MHz 
channel allocation. 


HDTV' 


The tremendous advances in digital technology combined with consumer demand for 
better picture and sound quality, plus computer compatibility, has motivated televi- 
sion manufacturers to develop a new US color TV standard: high-definition televi- 
sion (HDTV). A digital standard provides multimedia options such as special effects, 
editing, and so forth, and better computer interfacing. The HDTV standard supports 
at least 18 different formats and is a significant advancement over NTSC with respect 
to TV quality. One of the HDTV standards is shown in Table 7.4-1. First, with 


1Joào O. P. Pinto drafted this section. 
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respect to the NTSC system, the number of vertical and horizontal lines has doubled, 
and thus the picture resolution is four times greater. Second, the aspect ratio has been 
changed from 4/3 to 16/9. Third, as Figs. 7.4-12 and 7.4—13 indicate, HDTV has 
improved scene capture and viewing angle features. For example, with H equal to the 
TV screen height and with a viewing distance of 10 feet (7H) in the NTSC system, 
the viewing angle is approximately 10 degrees. Whereas with HDTV, the same 10 
foot viewing distance (3H) yields a viewing angle of approximately 20 degrees. 

HDTV has also adopted the AC-3 surround sound system instead of mono- 
phonic or stereo sound. This system has six channels: right, right surround, left, left 
surround, center, and low-frequency effect (LFE). The LFE channel has only a band- 
width of 120 Hz, effectively providing only 5.1 channels. 

HDTV can achieve a given signal-to-noise ratio with 12 dB less radiated power 
than NTSC-TV. Thus, for the same transmitter power, reception that was marginal 
with NTSC broadcasts will be greatly improved with HDTV. 

Although there was no attempt to make HDTV broadcast signals compatible with 
existing NTSC TV receivers, by 2006 the FCC will require that only digital signals be 
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Figure 7.4-12 Scene capabilities of conventional NTSC system and HDTV. 
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Figure 7.4-13 Viewing angles as a function of distance. (a) Conventional NTSC; (b) HDTV. 


broadcast, of which HDTV is one kind. Thus, in order to receive TV broadcasts, exist- 
ing TV sets will have to be replaced or modified by some type of converter. 

The system for encoding and transmitting HDTV signals is shown in Figure 7.4—14. 
The transmitter consists of several stages. First, the 24.9-MHz video signal and corre- 
sponding audio signals are compressed, so they will fit into the allocated 6-MHz channel 
bandwidth. The compressed audio and video data is then combined with ancillary data 
that includes control data, closed captioning, and so forth, using a multiplexer. The mul- 
tiplexer then formats the data into packets. Next, the packetized data is scrambled to 
remove any undesirable frequency discrete components, and is channel encoded. During 
channel coding the data is encoded with check or parity symbols using Reed-Solomon 
coding to enable error correction at the receiver. The symbols are interleaved to mini- 
mize the effects of burst-type errors where noise in the channel can cause successive 
symbols to be corrupted. Finally, the symbols are Trellis-Code Modulated (TCM). 
TCM, which will be discussed in Chapter 14, combines coding and modulation and 
makes it possible to increase the symbol transmission rate without an increase in error 
probability. The encoded data is combined with synchronization signals and is then 
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Figure 7.4-14 HDTV transmitter block diagram. 


8VSB modulated. 8VSB is a VSB technique where an 8-level baseband code is VSB 
modulated onto a given carrier frequency. 

The HDTV receiver shown in Figure 7.4—15 reverses the above process. As 
broadcasters and listeners make the transition from NTSC-TV to HDTV, they will 
be allowed to transmit both signals simultaneously. To overcome potential interfer- 
ence, the HDTV receiver uses the NTSC rejection filter to reject NTSC signals. A 
channel equalizer/ghost canceller stage, not shown, performs ghost cancellation and 
channel equalization. The phase tracker minimizes the effects of phase noise caused 
by the system's PLL. 

When digitized, the 24.9-MHz video signal has a bit rate of 1 Gbps, whereas a 
6-MHBz television channel can only accommodate 20 Mbps. Therefore a compres- 
sion ratio of more than 50:1 is required. The raw video signal obtained by the scan- 
ning process contains significant temporal and spatial redundancies. These are taken 
advantage of during the compression process. During the transmission of each 
frame, only those parts in the scene that move or change are actually transmitted. 
The specific compression process is the MPEG-2 (Motion Picture Expert Group-2), 
which uses the Discrete Cosine Transform (DCT). See Gonzalez and Woods (1992) 
for more information on the DCT. The MPEG-2 signals are readily interfaced to 
computers for multimedia capability. 
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Figure 7.4-15 HDTV receiver block diagram. 
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7.5 PROBLEMS 


Suppose a commercial AM superhet has been designed such that the image fre- 
quency always falls above the broadcast band. Find the minimum value of fj, the 
corresponding range of f; o, and the bounds on Bar. 


Suppose a commercial FM superhet has been designed such that the image fre- 
quency always falls below the broadcast band. Find the minimum value of frr, the 
corresponding range of fro, and the bounds on Bar. 


Suppose a commercial AM superhet has fp = 455 kHz and fro = 1/27 VLC, 
where L = 1 uH and C is a variable capacitor for tuning. Find the range of C when 
fio = fe + fir and when fro = fe — fir- 


Suppose the RF stage of a commercial AM superhet is a tuned circuit like Fig. 4.1-7 


with L = 1 uH and variable C for tuning. Find the range of C and the corresponding 
bounds on R. 


304 


7.1-5 


7.1-6 
7.1-7 


7.1-8 


7.1-9* 


7.1-10 


7.1-11 


7.1-12* 


7.1-13 


7.1-14* 


CHAPTER7 @ Analog Communication Systems 


Consider a superhet intended for USSB modulation with W = 4 kHz and f. = 
3.57-3.63 MHz. Take fro = fe + fig and choose the receiver parameters so that all 
bandpass stages have B/fy ~ 0.02. Then sketch |Hg-(f)| to show that the RF stage 
can be fixed-tuned. Also sketch |H,;(f)|, accounting for sideband reversal. 


Do Prob. 7.1—5 for LSSB modulation with W = 6 kHz and f. = 7.14-7.26 MHz. 


Sketch the spectrum of x(t) X cos 27f,ot to demonstrate the sideband-reversal 
effect in an SSB superhet when fro = f. + fir. 


For automatic frequency control in an FM superhet, the LO is replaced by a VCO 
that generates Ajo cos O(t) with 6(t) = 2m[f. — fie + Ku(t) + e(t] where e(?) is a 
slow and random frequency drift. The control voltage v(^ is derived by applying the 
demodulated signal to an LPF with B < W. The demodulated signal is yp(t) = 
Kpó s(t)/2@ where ¢;p(t) is the instantaneous phase at the IF output. Analyze this 
AFC system by finding yp(t) in terms of x(t) and e(7). 


Consider a superhet that receives signals in the 50-54 MHz range with fro = fe + fir. 
Assuming there is little filtering prior to the mixer, what range of input signals will 
be received if the fj, is: (a) 455 KHz, (b) 7 MHz? 


Design a receiver that will receive USSB signals in the 50-54 MHz range where 
fie = 100 MHz and does not exhibit sideband-reversal. Assume there is little filtering 
prior to the mixer. Specify f;o, the product detector oscillator frequency, the center 
frequency of the IF bandpass filter, and any image frequencies that will be received. 


Consider a superhet with fro = f. + fig, fir = 455 kHz, and f. = 2 MHz. The RF 
amplifier is preceded by a first-order RLC bandpass filter with f; = 2 MHz and B = 
0.5 MHz. Assume the IF-BPF is nearly ideal and that the mixer has unity gain. What 
is the minimum spurious frequency input rejection ratio in dB? 


Suppose the receiver of Prob. 7.1-11 has a LO with a second harmonic whose volt- 
age level 1s half that of the fundamental component. (a) What input frequencies will 
be accepted, and at what power level in dB as compared to the correct input? (b) Dis- 
cuss all ways to minimize these interfering inputs. 


Consider a superhet that receives signals in the 7.0 to 8.0 MHz range with f; o = f. + 
fig, and fre = 455 KHz. The receiver's RF amplifier has a passband of 2 MHz, and its 
IF-BPF is nearly ideal and has a bandwidth of 3 kHz. Design a frequency converter 
that has a fixed LO frequency that will enable the reception of 50.0- to 51.0-MHz 
signals. Assume the converter's RF amplifier is relatively wideband. (a) If the 
incoming frequency is supposed to be f. = 50 MHz, what other spurious frequencies 


will this receiver respond to? (b) Describe how to minimize these spurious 
responses. 


What is the image rejection performance of a single conversion superhet receiver 
that receives signals in the 50-54 MHz range, fro > f,, and has an RF amplifier 


that includes a fixed frequency RLC-BPF with B = 4 MHz with (a) fie = 20 MHz, 
(b) fic = 100 MHz? 
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Design a superhet receiver for a dual-mode cellular phone system that will accept 
either 850 MHz analog cellular signals or 1900 MHz digital personal communica- 
tions systems (PCS) signals. Specify the F,o, Fır, and image frequencies. 


Find suitable parameters of a double-conversion receiver having RR — 60 dB and 
intended for DSB modulation with W = 10 kHz and f. = 4 MHz. 


A double conversion receiver designed for f. = 300 MHz has fr- = 30 MHz and 
fir-2 = 3 MHz, and each LO frequency is set at the higher of the two possible val- 
ues. Insufficient filtering by the RF and first IP stages results in interference from 
three image frequencies. What are they? 


Do Prob. 7.1-17 with each LO frequency set at the lower of the two possible values. 


Specify the settings on a scanning spectrum analyzer to display the spectrum up to 
the 10th harmonic of a signal with a 50-ms period. 


Specify the settings on a scanning spectrum analyzer to display the spectrum of a 
tone-modulated FM signal with f. = 100 KHz, fa = 1 kHz, and B = 5. 


The magnitude spectrum of an energy signal u(t) can be displayed by multiplying 
v(t) with the swept-frequency wave cos (w,t — at?) and applying the product to a 
bandpass filter having Aj, (t) = cos (w,t + at”). Use equivalent lowpass time-domain 
analysis to show that hg,(t) = lgia? and that the envelope of the bandpass output is 
proportional to |V(f)| with f = at/a. 


Four signals, each having W = 3 kHz, are to be multiplexed with 1-kHz guard bands 
between channels. The subcarrier modulation is USSB, except for the lowest chan- 
nel which is unmodulated, and the carrier modulation is AM. Sketch the spectrum of 
the baseband and transmitted signal, and calculate the transmission bandwidth. 


Do Prob. 7.2-1 with AM subcarrier modulation. 


Let f; be an arbitrary carrier in an FDM signal. Use frequency-translation sketches to 
show that the BPFs in Fig. 7.22 are not necessary if the subcarrier modulation is 
DSB and the detector includes an LPF. Then show that the BPFs are needed, in gen- 
eral, for SSB subcarrier modulation. 


Ten signals with bandwidth W are to be multiplexed using SSB subcarrier modula- 
tion and a guard band B, between channels. The BPFs at the receiver have |H(f)| = 
exp {—[1.2(f — f9/ Wy), where fy equals the center frequency for each subcarrier 
signal. Find B, so that the adjacent-channel response satisfies |H(f)| 0.1. Then 
calculate the resulting transmission bandwidth of the FDM signal. 


Suppose the voice channels in a group signal have B, = 1 kHz and are separated at 
the receiver using BPFs with |H(f)| = (1 + (2(f — f9/BJ^) ?. Make a careful 
sketch of three adjacent channels in the group spectrum, taking account of the fact 
that a baseband voice signal has negligible content outside 200 « |f| « 3200 Hz. 


Use your sketch to determine values for B, fy, and n so that |H(f)| = 0.1 outside the 
desired passband. 
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Some FDM telemetry Systems employ proportional bandwidth FM subcarrier mod- 
ulation when tbe signals to be multiplexed have different bandwidths. All subcarrier 
signals have the same deviation ratio but the ith subcarrier frequency and message 
bandwidth are related by f, = W/c where a is a constant. (a) Show that the subcar- 
f; and obtain an expression for fi, in 
terms of f; to provide a guard band B, between channels. (b) Calculate the next three 
subcarrier frequencies when f, — 2 kHz, B, — 800 Hz, and B, = 400 Hz. 


Find the output signals of the quadrature-carrier system in Fig. 7.2-6 when the 
receiver local oscillator has a phase error $’. 


In one proposed system for FM quadraphonic multiplexing, the baseband signal in 
Fig. 7.2-4 is modified as follows: The unmodulated signal is xy(f) = Lp + Lg + Rp 
+ Rp (for monophonic compatibility), the 38-kHz subcarrier has quadrature-carrier 
multiplexing with modulating signals x,() and x(t), and the SCA signal is replaced 
by a 76-kHz subcarrier with DSB modulation by x3(t) = Le — Lg + Rp — Re. What 
should be the components of x,(#) for Stereophonic compatibility? Now consider 
XÀ X x,( + x4(f) to determine the components of x(t). Draw a block diagram of 
the corresponding transmitter and quadraphonic receiver. 


Suppose the transmission channel in Fig. 7.2-6 has linear distortion represented by 
the transfer function Hc(f). Find the resulting spectrum at the lower output and 
show that the condition for no cross talk is Af — f) = He + fà for |f| = W. If 
this condition holds, what must be done to recover x(t)? 


Twenty-four voice signals are to be transmitted via multiplexed PAM with a marker 
pulse for frame synchronization. The sampling frequency is 8 kHz and the TDM sig- 
nal has a 50 percent duty cycle. Calculate the signaling rate, pulse duration, and 
minimum transmission bandwidth. 


Do Prob. 7.2-10 with a 6-kHz sampling frequency and 30 percent duty cycle. 


Twenty signals, each with W = 4 kHz, are sampled at a rate that allows a 2-kHz 
guard band for reconstruction filtering. The multiplexed samples are transmitted on 
a CW carrier. Calculate the required transmission bandwidth when the modulation 
is: (a) PAM/AM with 25 percent duty cycle; (b) PAM/SSB with baseband filtering. 


Ten signals, each with W = 2 kHz, are sampled at a rate that allows a 1-kHz guard 
band for reconstruction filtering. The multiplexed samples are transmitted on a CW 
carrier. Calculate the required transmission bandwidth when the modulation is: (a) 
PPM/AM with 20 percent duty cycle; (b) PAM/FM with baseband filtering and f, = 
75 kHz. 


Given a 6-channel main multiplexer with f, = 8 kHz, devise a telemetry system sim- 
ilar to Fig. 7.2-11 (including a marker) that accommodates six input signals having 
the following bandwidths: 8.0, 3.5, 2.0, 1.8, 1.5, and 1.2 kHz. Make sure that suc- 
cessive samples of each input signal are equispaced in time. Calculate the resulting 
baseband bandwidth and compare with the minimum transmission bandwidth for an 
FDM-SSB system. 
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Do Prob. 7.2—14 for seven input signals having the following bandwidths: 12.0, 4.0, 
1.0, 0.9, 0.8, 0.5, and 0.3 kHz. 


Do Prob. 7.2—14 for eight input signals having the following bandwidths: 12.0, 3.5, 
2.0, 0.5, 0.4, 0.3, 0.2, and 0.1 KHz. 


Calculate the bandwidth required so the cross talk does not exceed —40 dB when 25 
voice signals are transmitted via PPM-TDM with f, = 8 kHz and tj = 7 = 0.2(T,/M). 


Find the maximum number of voice signals that can be transmitted via TDM-PPM 
with f, = 8 kHz and £j = 7 = 0.25(T/M) when the channel has B = 500 kHz and the 
cross talk is to be kept below —30 dB. 


Cross talk also occurs when a transmission system has inadequate low-frequency 
response, usually as a result of transformer coupling or blocking capacitors. Demon- 
strate this effect by sketching the pulse response of a high-pass filter whose step 
response is g(7) = exp (—27fet) u(t). Consider the extreme cases fpr << 1 and fer => 1. 


For one implementation of digital phase comparison, the switching circuit in Fig. 
7.3—1b has a set-reset flip-flop whose output becomes s(t) = + A after a positive- 
going zero-crossing of x,(f) and s(t) = —A after a positive-going zero-crossing of 
v(t). (a) Take x(t) = cos w,t and v(t) = cos (w,t — $,) and sketch one period of s(ġ 
for $, = 45, 135, 180, 225, and 315°. (b) Now plot y versus € = $, — 180? assum- 
ing that y(t) = <s(t)> . Note that this implementation requires + 180° phase differ- 
ence between the inputs for y = 0. 


Do part (a) of Prob. 7.3~1 for a digital phase comparator with a switch controlled by 
v(t) so its output is s(t) = A sgn x(t) when v(t) > 0 and s(t) = 0 when v( < 0. Now 
plot y versus € = $, —90° assuming that y(t) = <s(t)>. 

Consider a PLL in the steady state with €, < 1 fort < 0. The input frequency has a 


step change at t = 0, so f(t) = 27f,t for t > 0. Solve Eq. (5) to find and sketch e(t), 
assuming that K >> |Af + fi]. 


Explain why the Costas PLL system in Fig. 7.3-4 cannot be used for synchronous 
detection of SSB or VSB. 





Consider a PLL in steady-state locked conditions. If the external input is x,(f) = 
A, cos (w,t + 9), then the feedback signal to the phase comparator must be propor- 


tional to cos (ct + dy + 90° — e,). Use this property to find the VCO output in 
Fig. 7.3—5 when €, # 0. 


Use the property stated in Prob. 7.3—5 to find the VCO output in Fig. 7.3-6 when 
€, * O. 


Modify the FM stereo receiver in Fig. 7.2—5 to incorporate a PLL with f, ^ 38 kHz 
for the subcarrier. Also include a dc stereo indicator. 


Given a 100-kHz master oscillator and two adjustable divide-by-n counters with 
n = 1 to 10, devise a system that synthesizes any frequency from 1 kHz to 99 kHz in 
steps of 1 KHz. Specify the nominal free-running frequency of each VCO. 


308 


7.3-9 


7.3-10 


7.3-11 


7.3-12* 


7.3-13 


7.3-14t 


7.3-15t 


7.4-1 


7.4-2 


CHAPTER7. € Analog Communication Systems 


Referring to Table 7.1—1, devise a frequency synthesizer to generate fro = f. + fir 
for an FM radio. Assume you have available a master oscillator at 120.0 MHz and 
adjustable divide-by-n counters with n = 1 to 1000. 


Referring to Table 7.1-1, devise a frequency synthesizer to generate fro = f. + fir 
for an AM radio. Assume you have available a master oscillator at 2105 kHz and 
adjustable divide-by-n counters with n = 1 to 1000. 


The linearized PLL in Fig. 7.3-8 becomes a phase demodulator if we add an ideal 
integrator to get 


z(t) = | y(A) dA 


Find Z(f)/X(f) when the input is a PM signal. Compare with Eq. (11). 


Consider the PLL model in Fig. 7.3-8c, where E(f) = df) — ®,(f). (a) Find 
E(f)/®(f) and derive Eq. (10) therefrom. (b) Show that if the input is an FM signal, 
then E(f) = (f/KOH4GOXC) with Hf) = VIC) + jR). 


Suppose an FM detector is a linearized first-order PLL with H( f) = 1. Let the input 
signal be modulated by x(t) = Am cos 277f,,t where A,, = 1 and 0 =f, = W. (a) Use 
the relationship in Prob. 7.3-12b to find the steady-state amplitude of e(t). 
(b) Since linear operation requires |e(f)| = 0.5 rad, so sin e ^ e, show that the mini- 
mum loop gain is K = 2f4. 


Suppose an FM detector is a second-order PLL with loop gain K and H(f) = 1 + 
K/j2f. Let the input signal be modulated by x() = Am cos 27f,,t where A,, = 1 and 
0 =f, = W. (a) Use the relationship in Prob. 7.3-12b to show that the steady-state 
amplitude of e(r) is maximum when f, = K/ V2 if K/ V2 = W. (b) Now assume 
that K/ V2 > W and fa > W. Since linear operation requires |e(f)| < 0.5 rad, so 
sin € = e, show that the minimum loop gain is K ~ 2 V fAW. 


Consider the second-order PLL in Prob. 7.3-14. (a) Show that H,(f) becomes a 
second-order LPF with |H,| maximum at f = 0.556K and 3-dB bandwidth B = 
1.14K. (b) Use the loop-gain conditions in Probs. 7.3-13 and 7.3-14 to compare the 
minimum 3-dB bandwidths of a first-order and second-order PLL FM detector when 
fA/W = 2, 5, and 10. 


Explain the following statements: (a) A TV frame should have an odd number of 
lines. (b) The waveform that drives the scanning path should be a sawtooth, rather 
than a sinusoid or triangle. 


Consider a scanning raster with very small slope and retrace time. Sketch the video 
signal and its spectrum, without using Eq. (4), when the image consists of: (a) alter- 
nating black and white vertical bars of width H/4; (b) alternating black and white 
horizontal bars of height V/4. 
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Consider an image that's entirely black (J = 0) except for a centered white rectangle 
(I = 1.0) of width eH and height BV. (a) Show that |c,,| = œ| sinc am sinc Bn]. 
(b) Sketch the resulting line spectrum when a = 1/2, B = 1/4, and f, = f,/100. 


Calculate the number of pixels and the video bandwidth requirement for a low- 
resolution TV system with a square image, 230 active lines, and 100-ys active line 
time. 


Calculate the number of pixels and the video bandwidth requirement for the HDTV 
system in Table 7.4—1 if N,, <N and T,, = 0.2Tiine- 


Calculate the number of pixels and the video bandwidth requirement for the CCIR 
system in Table 7.4—1 if N,, = 48 and T,, = 10 ps. 


Horizontal aperture effect arises when the scanning process in a TV camera pro- 
duces the output. 


x(t) = | x(A) dÀ 


l= % 
where x(t) is the desired video signal and 7 < Tine. (a) Describe the resulting TV 
picture. (b) Find an equalizer that will improve the picture quality. 


Describe what happens to a color TV picture when: (a) the gain of the chrominance 
amplifier is too high or too low; (b) the phase adjustment of the color subcarrier is in 
error by +90 or 180°. 


Carry out the details leading from Eq. (15) to Eq. (16). 


Obtain expressions equivalent to Eqs. (15) and (16) when all the filters in the xp chan- 
nel (at transmitter and receiver) are the same as the x, channel. Discuss your results. 
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hapters 2 through 7 dealt entirely with deterministic signals, for when we write an explicit time function v(t} we 

presume that the behavior of the signal is known or determined for all time. In Chapter 9 we'll deal with random 
signals whose exact behavior cannot be described in advance. Random signals occur in communication both as 
unwanted noise and as desired information-bearing waveforms. Lacking detailed knowledge of the time variation of 
a random signal, we must speak instead in terms of probabilities and statistical properties. This chapter therefore pre- 
sents the groundwork for the description of random signals. 

The major topics include probabilities, random variables, statistical averages, and important probability models. 
We direct our coverage specifically toward those aspects used in later chapters and rely heavily on intuitive reason- 
ing rather than mathematical rigor. 

If you've previously studied probability and statistics, then you can skim over this chapter and go to Chapter 9. 
(However, be alert for possible differences of notation and emphasis.} If you want to pursue the subject in greater 
detail, you'll find a wealth of material in texts devoted to the subject. 


OBJECTIVES 


After studying this chapter and working the exercises, you should be able to do each of the following: 


l. Calculate event probabilities using frequency of occurrence and the relationships for mutually exclusive, joint, 
conditional, and statistically independent events (Sect. 8.1). 


2. Define and state the properties of the probability functions of discrete and continuous random variables (Sect. 8.2). 


Write an expression for the probability of a numerical-valued event, given a frequency function, CDF, or PDF 
(Sect. 8.2). 


Find the mean, mean-square, and variance of a random variable, given its frequency function or PDF (Sect. 8.3). 
Define and manipulate the expectation operation (Sect. 8.3). 

Describe applications of the binomial, Poisson, gaussian, and Rayleigh probability models (Sect. 8.4). 

Write probabilities for a gaussian random variable in terms of the Q function (Sect. 8.4). 
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8.1 PROBABILITY AND SAMPLE SPACE 


Probability theory establishes a mathematical framework for the study of random 
phenomena. The theory does not deal with the nature of random processes per se, 
but rather with their experimentally observable manifestations. Accordingly, we'll 
discuss probability here in terms of events associated with the outcomes of experi- 
ments. Then we'll introduce sample space to develop probability theory and to 
obtain the probabilities of various types of events. 


Probabilities and Events 


Consider an experiment involving some element of chance, so the outcome varies 
unpredictably from trial to trial. Tossing a coin is such an experiment, since a trial 
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toss could result in the coin landing heads up or tails up. Although we cannot predict 
the outcome of a single trial, we may be able to draw useful conclusions about the 
results of a large number of trials. 

For this purpose, let's identify a specific event A as something that might be 
observed on any trial of a chance experiment. We repeat the experiment N times and 
record N,, the number of times A occurs. The ratio N,/N then equals the relative fre- 
quency of occurrence of the event A for that sequence of trials. 

The experiment obeys the empirical law of large numbers if V,/N approaches 
a definite limit as N becomes very large and if every sequence of trials yields the 
same limiting value. Under these conditions we take the probability of A to be 


P(A) = N,/N N — oo [1] 


The functional notation P(A) emphasizes that the value of the probability depends 


upon the event in question. Nonetheless, every probability is a nonnegative number 
bounded by 


0=P(A) <1 


since 0 = N, = N for any event A. 

Our interpretation of probability as frequency of occurrence agrees with intu- 
ition and common experience in the following sense: You can't predict the specific 
result of a single trial of a chance experiment, but you expect that the number of 
times A occurs in Nœ 1 trials will be N, ^ NP(A). Probability therefore has mean- 
ing only in relation to a large number of trials. 

By the same token, Eq. (1) implies the need for an infinite number of trials to 
measure an exact probability value. Fortunately, many experiments of interest pos- 
sess inherent symmetry that allows us to deduce probabilities by logical reasoning, 
without resorting to actual experimentation. We feel certain, for instance, that an 
honest coin would come up heads half the time in a large number of trial tosses, so 
the probability of heads equals 1/2. 

Suppose, however, that you seek the probability of getting two heads in three 
tosses of an honest coin. Or perhaps you know that there were two heads in three 
tosses and you want the probability that the first two tosses match. Although such 
problems could be tackled using relative frequencies, formal probability theory pro- 
vides a more satisfactory mathematical approach, discussed next. 


sample Space and Probability Theory 


A typical experiment may have several possible outcomes, and there may be various 
ways of characterizing the associated events. To construct a systematic model of a 
chance experiment let the sample space S denote the set of outcomes, and let S be 


partitioned into sample points s}, S}, . . . , corresponding to the specific outcomes. 
Thus, in set notation, 


S = [sx] 
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Although the partitioning of S is not unique, the sample points are subject to two 
requirements: | 


1. Theset(s, 5, ... ) must be exhaustive, so that S consists of all possible out- 
comes of the experiment in question. 


2. The outcomes s,, 5, . . . must be mutually exclusive, so that one and only one 
of them occurs on a given trial. 


Consequently, any events of interest can be described by subsets of S containing 
Zero, one, or more than one sample points. 

By way of example, consider the experiment of tossing a coin three times and 
observing the sequence of heads (H) and tails (T). The sample space then contains 
2 X 2 X 2 — 8 distinct sequences, namely, 


5 = (HHH, HTH, HHT, THH, THT, TTH, HTT, TIT} 


where the order of the listing is unimportant. What is important is that the eight 
sample-point sequences are exhaustive and mutually exclusive. The event A = “two 
heads” can therefore be expressed as the subset 


A = {HTH, HHT, THH} 


Likewise, the events B = "second toss differs from the other two” and C = “first 
two tosses match” are expressed as - 


B = {HTH,THT} | C = {HHH, HHT, TTH, TIT} 


Figure 8.1—1 depicts the sample space and the relationships between A, B, and 
C in the form of a Venn diagram, with curves enclosing the sample points for each 
event. This diagram brings out the fact that B and C happen to be mutually exclusive 
events, having no common sample points, whereas A contains one point in common 
with B and another point in common with C. 








Figure 8.1-1 Sample space and Venn diagram of three events. 
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Other events may be described by particular combinations of event subsets, as 
follows: 
* The union event A + B (also symbolized by A U B) stands for the occurrence 
of A or B or both, so its subset consists of all s; in either A or B. 


The intersection event AB (also symbolized by A N B) stands for the occur- 
rence of A and B, so its subset consists only of those s; in both A and B. 


For instance, in Fig. 8.1-1 we see that 
A + B = {ATH, HHT, THH, THT } AB = {HTH} 
But since B and C are mutually exclusive and have no common sample points, 
BC = Ø 


where @ denotes the empty set. 


Probability theory starts with the assumption that a probability P(s,) has been 
assigned to each point s; in the sample space S for a given experiment. The theory 
says nothing about those probabilities except that they must be chosen to satisfy 
three fundamental axioms: 


P(A) = 0 for any event A in S [2a] 
P(S) =1 [25] 
P(A, + A5) = P(A;) + P(A5) if 4,4; = Ø [2d 


These axioms form the basis of probability theory, even though they make no men- 
tion of frequency of occurrence. Nonetheless, axiom (2a) clearly agrees with Eq. 
(1), and so does axiom (2b) because one of the outcomes in S must occur on every 
trial. To interpret axiom (2c) we note that if A, occurs N, times in N trials and A, 
occurs N, times, then the event “A, or A,” occurs N, + N, times since the stipulation 
A,A, = Ø means that they are mutually exclusive. Hence, as N becomes large, 
P(A; + Aj) = (N, + NN = (NUN) + (NN) = P(A) + P(A). 

Now suppose that we somehow know all the sample-point probabilities P(s;) for 
a particular experiment. We can then use the three axioms to obtain relationships for 
the probability of any event of interest. To this end, we'll next state several impor- 
tant general relations that stem from the axioms. The omitted derivations are exer- 
cises in elementary set theory, and the relations themselves are consistent with our 
interpretation of probability as relative frequency of occurrence. 


Axiom (2c) immediately generalizes for three or more mutually exclusive 
events. For if 


A,A,A3... =O 
then 


P(A, + Ap + Ag + ...) = P(A;) + P(A,) + P(A3) + ... [3] 
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Furthermore, if M mutually exclusive events have the exhaustive property 
A+A, + us. tAy=S 


then, from axioms (2c) and (2D), 
M 
P(A, + Ay + ^c + Ay) = >) P(A) = 1 [4] 
i=] 


Note also that Eq. (4) applies to the sample-point probabilities P(s;). 

Equation (4) takes on special importance when the M events happen to be 
equally likely, meaning that they have equal probabilities. The sum of the probabil- 
ities in this case reduces to M X P(A;) = 1, and hence 


P(A) =1/M  i=1,2,...,M [5] 


This result allows you to calculate probabilities when you can identify all possible 
outcomes of an experiment in terms of mutually exclusive, equally likely events. 
The hypothesis of equal likelihood might be based on experimental data or symme- 
try considerations—as in coin tossing and other honest games of chance. 

Sometimes we'll be concerned with the nonoccurrence of an event. The event 
“not A” is called the complement of A, symbolized by AC (also written A). The 
probability of AC is 


P(AS) = 1 — P(A) [6] 
since A + AT = S and AAC = Ø. 


Finally, consider events A and B that are not mutually exclusive, so axiom (2c) 
does not apply. The probability of the union event A + B is then given by 


P(A + B) = P(A) + P(B) — P(AB) [7] 


in which P(AB) is the probability of the intersection or joint event AB. We call P(AB) 
the joint probability and interpret it as 


P(AB) = N/N | N—oo 


where N,, stands for the number of times A and B occur together in N trials. Equa- 
tion (7) reduces to the form of axiom (2c) when AB = Ø, so A and B cannot occur 
together and P(AB) = 0. 


As an application of our probability relationships, we’ll calculate some event prob- 
abilities for the experiment of tossing an honest coin three times. Since H and T 
are equally likely to occur on each toss, the eight sample-point sequences back in 
Fig. 8.1-1 must also be equally likely. We therefore use Eq. (5) with M = 8 to get 
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The probabilities of the events A, B, and C are now calculated by noting that 
A contains three sample points, B contains two, and C contains four, so Eq. (3) 
yields 


PA) =iti+is} PG=} XOS: 


Similarly, the joint-event subsets AB and AC each contain just one sample point, 
so 


P(AB) = P(AC) = 1/8 


whereas P(BC) = 0 since B and C are mutually exclusive. 
The probability of the complementary event AC is found from Eq. (6) to be 


Pac) =1-3=§ 
The probability of the union event A + B is given by Eq. (7) as 
PA + B) =} +4 -4 


Our results for P(A°) and P(A + B) agree with the facts that the subset A^ contains 
five sample points and A + B contains four. 





EXERCISE 8.1-1 


A certain honest wheel of chance is divided into three equal segments colored green 
(G), red (R), and yellow (Y), respectively. You spin the wheel twice and take the out- 
come to be the resulting color sequence—GR, RG, and so forth. Let A = “neither 
color is yellow” and let B = “matching colors.” Draw the Venn diagram and calcu- 
late P(A), P(B), P(AB), and P(A + B). 








Conditional Probability and Statistical Independence 


Sometimes an event B depends in some way on another event A having P(A) + 0. 
Accordingly, the probability of B should be adjusted when you know that A has 
occurred. Mutually exclusive events are an extreme example of dependence, for if 
you know that A has occurred, then you can be sure that B did not occur on the same 
trial. Conditional probabilities are introduced here to account for event dependence 
and also to define statistical independence. 

We measure the dependence of B on A in terms of the conditional probability 


P(B|A) $ P(AB)/P(A) [8] 


The notation B|A stands for the event B given A, and P(B|A) represents the proba- 
bility of B conditioned by the knowledge that A has occurred. If the events happen to 
be mutually exclusive, then P(AB) = 0 and Eq. (8) confirms that P(B|A) = 0 as 
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expected. With P(AB) # 0, we interpret Eq. (8) in terms of relative frequency by 
inserting P(AB) = N,4/N and P(A) = N,/N as N > œ. Thus, 
Nial N _ Naa 


P(B|A) = NJN Ny, 





which says that P(B|A) equals the relative frequency of A and B together in the N, 
trials where A occurred with or without B. 

Interchanging B and A in Eq. (8) yields P(A|B) = P(AB)/P(B), and we thereby 
obtain two relations for the joint probability, namely, 


P(AB) = P(A|B)P(B) = P(BIA)P(A) [9] 
Or we could eliminate P(AB) to get Bayes’ theorem 
P(B)P(A|B 

P(B|A) = meee [10] 


This theorem plays an important role in statistical decision theory because it allows us 
to reverse the conditioning event. Another useful expression is the total probability 


P(B) = > POIA)PA) 1 


where the conditioning events Aj, Aj, . . . , Ay must be mutually exclusive and 
exhaustive. 

Events A and B are said to be statistically independent when they do not 
depend on each other, as indicated by 


P(B|A) = P(B) P(A|B) — P(A) [12] 
Inserting Eq. (12) into Eq. (9) then gives 
P(AB) = P(A)P(B) 


so the joint probability of statistically independent events equals the product of the 
individual event probabilities. Furthermore, if three or more events are all indepen- 
dent of each other, then 


P(ABC ...) = P(A)P(B)P(C)... [13] 


in addition to pairwise independence. 

As a rule of thumb, physical independence is a sufficient condition for statisti- 
cal independence. We may thus apply Eq. (12) to situations in which events have no 
physical connection. For instance, successive coin tosses are physically indepen- 
dent, and a sequence such as TTH may be viewed as a joint event. Invoking the 
equally likely argument for each toss alone, we have P(H) = P(T) = 1/2 and 
P(TTH) = P(T)P(TD)P(H) = (1/2) = 1/8—in agreement with our conclusion in 
Example 8.1-1 that P(s,) = 1/8 for any three-toss sequence. 
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In Example 8.1-1 we calculated the probabilities P(A) — 3/8, P(B) — 2/8, and EXAMPLE 8.1-2 


P(AB) — 1/8. We'll now use these values to investigate the dependence of events A 
and B. 

Since P(A)P(B) = 6/64 # P(AB), we immediately conclude that A and B are not 
statistically independent. The dependence is reflected in the conditional probabilities 


P(AB) " 1/8 zd P(AIB) = P(AB) X 1/8 i 
P(A) 3/8 3 P(B) 2/8 2 
so P(B|A) # P(B) and P(A|B) # P(A). 

Reexamination of Fig. 8.1-1 reveals why P(B|A) > P(B). Event A corresponds 
to any one of three equally likely outcomes, and one of those outcomes also corre- 
sponds to event B. Hence, B occurs with frequency N,,/N, = 1/3 of the N, trials in 


which A occurs—as contrasted with P(B) = N,/N = 2/8 for all N trials. Like rea- 
soning justifies the value of P(A|B). 





P(B|A) = 











The resistance R of a resistor drawn randomly from a large batch has five possible EXAMPLE 8.1-3 
values, all in the range 40-60 Q. Table 8.1—1 gives the specific values and their 
probabilities. 


Table 8.1-1 
R: 40 45 50 55 60 
P,(R): 0.1 0.2 0.4 0.2 0.1 


Let the event A be “R = 50 Q” so 
P(A) = P(R = 40 or R = 45 or R = 50) = P«(40) + Pe(45) + Pe(50) = 0.7 
Similarly, the event B = 45 Q = R < 55 Q has 
P(B) = P45) + R(50) + PR(55) = 0.8 
The events A and B are not independent since 
P(AB) = PR(45) + PR(50) = 0.6 
which does not equal the product P(A)P(B). Then, using Eqs. (7) and (9), 


0.6 0.6 
P(A + B) = 0.7 + 0.8 — 0.6 = 0.9 P(B|A) = 07^ 0.857 P(A|B) = OR 0.75 
The value of P(A + B) is easily confirmed from Table 8.1-1, but the conditional 
probabilities are most easily calculated from Eq. (9). 
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Referring to Fig. 8.1—1 let 
D = {THT, TTH, HTT, TIT] 


which expresses the event “two or three tails" Confirm that B and D are statistically 
independent by showing that P(B|D) = P(B), P(D|B) = P(D), and P(B)P(D) = 
P(BD). 


8.2 RANDOM VARIABLES AND PROBABILITY 
FUNCTIONS 


Coin tossing and other games of chance are natural and fascinating subjects for 
probability calculations. But communication engineers are more concerned with 
random processes that produce numerical outcomes—the instantaneous value of a 
noise voltage, the number of errors in a digital message, and so on. We handle such 
problems by defining an appropriate random variable, or RV for short. 

Despite the name, a random variable is neither random nor a variable. Instead, 


it's a function that generates numbers from the outcomes of a chance experiment. 
Specifically, 











Almost any relationship may serve as an RV, provided that X is real and single- 
valued and that 


P(X = —o0) = P(X = œ) = 0 


The essential property is that X maps the outcomes in S into numbers along the real 

line ~œ < x < ~, (More advanced presentations deal with complex numbers.) 
We'll distinguish between discrete and continuous RVs, and we'll develop 

probability functions for the analysis of numerical-valued random events. 


Discrete Random Variables and CDFs 


If S contains a countable number of sample points, then X will be a discrete RV hav- 
ing a countable number of distinct values. Figure 8.2-1 depicts the corresponding 
mapping processes and introduces tbe notation x, < x, <... for the values of X(s) 
in ascending order. Each outcome produces a single number, but two or more out- 
comes may map into the same number. 


nere, ia era at 
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Although a mapping relationship underlies every RV, we usually care only 
about the resulting numbers. We'll therefore adopt a more direct viewpoint and treat 
X itself as the general symbol for the experimental outcomes. This viewpoint allows 
us to deal with numerical-valued events such as X = a or X = a, where a is some 
point on the real line. Furthermore, if we replace the constant a with the independent 
variable x, then we get probability functions that help us calculate probabilities of 
numerical-valued events. 

The probability function P(X = x) is known as the cumulative distribution 
function (or CDF), symbolized by 


Fy (x) 2 P(X x x) [1] 


Pay careful attention to the notation here: The subscript X identifies the RV whose 
characteristics determine the function F(x), whereas the argument x defines the 
event X < x so x is not an RV. Since the CDF represents a probability, it must be 
bounded by 


0x F(x) =1 [2a] 

with extreme values 
Fy(—o00) = 0 Fy(c0) = 1 i2b] 
The lower limit reflects our condition that P(X = —o) = 0, whereas the upper limit 


says that X always falls somewhere along the real line. The complementary events 
X = x and X > x encompass the entire real line, so 


P(X > x) = 1 — F(x) [3] 


Other CDF properties will emerge as we go along. 


X(s) 


xy X3 Xk 


Figure 8.2-1] Sample points mapped by the discrete RY X[s] into numbers on the real line. 
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XSa a<X<b X>ob 
9 C—F= 


a b 


Figure 8.2-2 Numerical-valued events along the real line. 


Suppose we know F(x) and we want to find the probability of observing a < 
X = b. Figure 8.2-2 illustrates the relationship of this event to the events X < a and 
X > b. The figure also brings out the difference between open and closed inequali- 


ties for specifying numerical events. Clearly, the three events here are mutually 
exclusive when b > a, and 


P(X sa) + P(a«Xzb) -* PX» b)  P(Xzxcoo)-«1 
Substituting P(X = a) = F(a) and P(X > b) = 1 — Fy(b) yields the desired result 
Pia < X € b) = F(b) — F(a) ba [4] 


Besides being an important relationship in its own right, Eq. (4) shows that F(x) has 

the nondecreasing property Fy(b) = F,(a) for any b > a. Furthermore, Fy(x) is con- 

tinuous from the right in the sense that if e > 0 then Fy(@ + €)  Fy(x) as e  O. 
Now let's take account of the fact that a discrete RV is restricted to distinct val- 


ues xj, Xo, . ... This restriction means that the possible outcomes X = x; constitute a 
set of mutually exclusive events. The corresponding set of probabilities will be writ- 
ten as 
^ : 
P(x) = P(X = xj) feta Oe ee [5] 


which we call the frequency function. Since the x; are mutually exclusive, the prob- 
ability of the event X = x, equals the sum 


P(X = xy) = PyG) + Px(x2) + cc + Py) 


Thus, the CDF can be obtained from the frequency function P,(x,) via 


Fy(x,) = $ Py(xi) [6] 


This expression indicates that F,(x) looks like a staircase with upward steps of 
height Py(x;) at each x = x;. The staircase starts at F(x) = O for x < x, and reaches 


F(x) = 1 at the last step. Between steps, where x, < x < x,,,, the CDF remains con- 
stant at Fx(x,). 





EXAMPLE 8.2-1 Consider the experiment of transmitting a three-digit message over a noisy channel. 
The channel has error probability P(E) — 2/5 — 0.4 per digit, and errors are statisti- 
cally independent from digit to digit, so the probability of receiving a correct digit is 
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P(C) = 1 — 2/5 = 3/5 = 0.6. We'll take X to be the number of errors in a received 
message, and we'll find the corresponding frequency function and CDF. 

The sample space for this experiment consists of eight distinct error patterns, 
like the head-tail sequences back in Fig. 8.1—1. But now the sample points are not 
equally likely since the error-free pattern has P(CCC) = P(C)P(C)P(C) = (3/5 = 
0.216, whereas the all-error pattern has P(EEE) = (2/5)? = 0.064. Similarly, each of 
the three patterns with one error has probability (2/5) X (3/5)? and each of the three 
patterns with two errors has probability (2/5)? X (3/5). Furthermore, although there 
are eight points in S, the RV X has only four possible values, namely, x; = 0, 1, 2, 
and 3 errors. 

Figure 8.2-3a shows the sample space, the mapping for X, and the resulting 
values of Py(xj). The values of F,(x,) are then calculated via 


FeO) = P(0) F1) = Px(0) + Px(1) 


and so forth in accordance with Eq. (6). The frequency function and CDF are plotted 
in Fig. 8.2-3b. We see from the CDF plot that the probability of less than two errors 
is Fy(2 — €) = F,(1) = 81/125 = 0.648 and the probability of more than one error 
is 1 — Fy(1) = 44/125 = 0.352. 





Let a random variable be defined for the experiment in Exercise 8.1-1 (p. 317) by EXERCISE 8.2-1 
the following rule: The colors are assigned the numerical weights G = 2, R = —- 1, 
and Y = 0, and X is taken as the average of the weights observed on a given trial of 
two spins. For instance, the outcome RY maps into the value X(RY) = (—1 + 0/2 = 


— 0.5. Find and plot P,(x,) and Fy(x). Then calculate P(—1.0 < X = 1.0). 





Continuous Random Variables and PDFs 


A continuous RV may take on any value within a certain range of the real line, 
rather than being restricted to a countable number of distinct points. For instance, 
you might spin a pointer and measure the final angle 0. If you take X(0) = tan? 0, as 
shown in Fig. 8.2-4, then every value in the range 0 = x < œ% is a possible outcome 
of this experiment. Or you could take X(0) — cos 0, whose values fall in the range 
—10zx«z1.. 

Since a continuous RV has an uncountable number of possible values, the 
chance of observing X = a must be vanishingly small in the sense that P(X = a) = 0 
for any specific a. Consequently, frequency functions have no meaning for continu- 
ous RVs. However, events such as X = a and a < X = b may have nonzero proba- 
bilities, and F(x) still provides useful information. Indeed, the properties stated 
before in Eqs. (1)- (4) remain valid for the CDF of a continuous RV. 


P(x;) 
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Figure 8.2-3 (a) Mapping for Example 8.2-1. (b] Frequency function and CDF for the dis- 
crete RV in Example 8.2-1. 






X=tan2 6 


0 


324 Figure 8.2-4 Mapping by a continuous RV. 
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But a more common description of a continuous RV is its probability density 
function (or PDF), defined by 


px(x) Ê dF (x)/dx [7] 


provided that the derivative exists. We don't lose information when differentiating 
F(x) because we know that Fy4(—co) = 0. We can therefore write 


P(X = x) = F(x) = | p, (4) da [8] 


~ 


where we've used the dummy integration variable A for clarity. Other important 
PDF properties are 


px(x) = 0 MS dx =1 [9] 
b 
P(a < X = b) = F,(b) — F(a) = | px(x) dx [10] 


a 





As a special case of Eq. (10), let a = x — dx and b = x. The integral then 
reduces to the differential area p,(x) dx and we see that 


px(x) dx = P(x — dx < X < x) SERI 


This relation serves as another interpretation of the PDF, empnasizing its nature as a 
probability density. Figure 8.2-5 shows a typical PDF for a continuous RV and the 
areas involved in Eqs. (10) and (11). 


p.) 


P(x-dx«X&x) 


P(a« X&b) 








jaa = a b 


Figure 8.2-5 A typical PDF and the area interpretation of probabilities. 
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Occasionally we'll encounter mixed random variables having both continuous 
and discrete values. We treat such cases using impulses in the PDF, similar to our 
spectrum of a signal containing both nonperiodic and periodic components. Specifi- 
cally, for any discrete value x) with nonzero probability Py(x)) = P(X = x) # 0, the 
PDF must include an impulsive term Py(x;)6(x — xo) so that F’,(x) has an appropriate 
jump at x — x,. Taking this approach to the extreme, the frequency function of a dis- 
crete RV can be converted into a PDF consisting entirely of impulses. 

But when a PDF includes impulses, you need to be particularly careful with 
events specified by open and closed inequalities. For if py(x) has an impulse at xo, 
then the probability that X = x; should be written out as P(X = xy) = P(X > xy) + 
P(X = x). In contrast, there's no difference between P(X = xp) and P(X > xj) for a 
strictly continuous RV having P(X = xy) = 0. 


Uniform PDF 


To illustrate some of the concepts of a continuous RV, let's take X = 0 (radians) for 
the angle of the pointer back in Fig. 8.2—4. Presumably all angles between 0 and 27r 
are equally likely, so p(x) has some constant value C for 0 < x = 2r and p,(x) = 0 
outside this range. We then say that X has a uniform PDF. 

The unit-area property requires that 


co 2m 
[ py) ar = | Cdx=15>C=1/20 


—oo0 0 
so 


1/27 0€xzx2mv 
0 otherwise 


1 
PX) = 5 Lue) — «(x = 22)] = l 
which is plotted in Fig. 8.2-6a. Integrating p,(x) per Eq. (8) yields the CDF in 
Fig. 8.2-6b5, where 
FX(x)—x/2m | O0€x-z2m 


so, for example, P(X = m) = Fy(7) = 1/2. These functions describe a continuous 
RV uniformly distributed over the range 0 < x S 277. 
But we might also define another random variable Z such that 


ge XST 
X X>T7 


Then P(Z < 7) = 0, P(Z = m) = P(X S r) = 1/2, and P(Z = z) = PX = z) for 
z > m. Hence, using z as the independent variable for the real line, the PDF of Z is 


1 1 
Pelz) = 59 — m) + =~ [ule ~ 7) ~ ulz ~ 27)] 


The impulse here accounts for the discrete value Z = m. 
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Figure 8.2-6 PDF and CDF of a uniformly distributed RV. 





Use the PDFs in Example 8.2-2 to calculate the probabilities of the following EXERCISE 8.2-2 
events: (a) m < X x 37/2, (b) X > 32/2, (c) m < Z < 37/2, and (d) m E ZS 37/2. 





Transformations of Random Variables 


The preceding example touched upon a transformation that defines one RV in terms 
of another. Here, we'll develop a general expression for the resulting PDF when the 
new RV is a continuous function of a continuous RV with a known PDF. 


Suppose we know py(x) and we want to find pz(z) for the RV related to X by the 
transformation function 


Z = g(X) 
We initially assume that g(X) increases monotonically, so the probability of observ- 
ing Z in the differential range z — dz < Z = z equals the probability that X occurs in 
the corresponding range x — dx < X = x, as illustrated in Fig. 8.2-7. Equation (10) 
then yields p(z) dz = py(x) dx, from which p4(z) = py(x) dx/dz. But if g(X) 
decreases monotonically, then Z increases when X decreases and pz(z) = 
Dx(x)(— dx/dz). We combine both of these cases by writing 


dz 


Finally, since x transforms to z — g(x), we insert the inverse transformation x — 
g }(z) to obtain 


pz(z) = px(x) 








T A TEE m rn t ATH etn n emen i neni eem 


dg (z) 


Pz(z) = Pxle ^ (z)] 72 


[121 








| which holds for any monotonic function. 
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Z-g(X 





Figure 8.2-7 Transformation of an RV. 


A simple but important monotonic function is the linear transformation 
Z=axX+B [13a] 


where o and f are constants. Noting that z = g(x) = ax + B, x = g (z) = 
(z — Ba, and dg" (z)dz = l/a, Eq. (12) becomes 
1 zB 
px) = Ed [135] 


|j" a 


Hence, pz(z) has the same shape as py(x) shifted by (9 and expanded or compressed 
by a. 

If g(X) is not monotonic, then two or more values of X produce the same value 
of Z. We handle such cases by subdividing g(x) into a set of monotonic functions, 





gi (X). g.(x),..., defined over different ranges of x. Since these ranges correspond to 
mutually exclusive events involving X, Eq. (12) generalizes as 
ars (dgr (2) aler o 
P2(z) = Pxl8i (z)] dz + px[22!(2)] ae qe [14] 








The following example illustrates this method. 


Consider the transformation Z = cos X with X being the uniformly distributed angle 
from Example 8.2-2. The plot of Z versus X in Fig. 8.2-8a brings out the fact that Z 


goes twice over the range —1 to 1 as X goes from 0 to 27, so the transformation is 
not monotonic. 


To calculate pz(z), we first subdivide g(x) into the two monotonic functions 
gi(x) = cos x 0cxzq 
g(x) = cos x m IxT 


which happen to be identical except for the defining ranges. For the range 0 < x S 7, 
we have py(x) = 1/27 with x = g7!(z) = cos^! z, so dg; (z/dz = —(1 — z?) ?? and 


i ' 
esaea | 
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Equation (15) corresponds to the volume between the x-y plane and the surface 
PxY (xy) for the stated ranges of x and y. 

If X and Y happen to be statistically independent, then their "T PDF reduces 
to the product 


Dxy(% y) = Px(x)py(y) [16] 
Otherwise, the dependence of Y on X is expressed by the conditional PDF 
Pxv(% y) 
prls) = [17] 
px 


which corresponds to the PDF of Y given that X — x. 

The PDF for X alone may be obtained from the joint PDF by noting that P(X = x) 
= P(X Sx, ~% < Y < oo) since the value of Y doesn't matter when we're only con- 
cerned with X. Thus, 


px(x) = | Py Y) dy [18] 


-—00 


We call p,(x) a marginal PDF when it's derived from a joint PDF per Eq. (18). 





The joint PDF of two noise voltages is known to be 


1 2 
Pxy(x. y) co ener Nee —oo <x < œ, ~% < y < oo 
TT 
From Eq. (18), the marginal PDF for X alone is 


N 1 2 2 1 2 es 2 1 2 
px(x) = | — e70 at? ay = Lenf e™ dÀ = gre 
o 





24r 


where we have made the change of variable A = y — x/2. In like manner, 





EN 2) l / 

ut —(y? ~ xy +x7/2 = —y?/2 
pry) -| gom dx = e 
Leo 2T 2T 


Thus, X and Y are not independent since py (prO) # pyxy(x y). But Eq. (17) yields 

the conditional PDFs 

1 -(y-9 +2) l -G-9 +4) 

= e v px(xly) = ae ta 
Vr : Var 


T 














8.3 STATISTICAL AVERAGES 


For some purposes, a probability function provides more information about an RV 
than actually needed. Indeed, the complete description of an RV may prove to be an 
embarrassment of riches, more confusing than illuminating. Thus, we often find it 
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more convenient to describe an RV by a few characteristic numbers. These numbers 
are the various statistical averages presented here. 


Means, Moments, and Expectation 


The mean of the random variable X is a constant m, that equals the sum of the val- 
ues of X weighted by their probabilities. This statistical average corresponds to an 
ordinary experimental average in the sense that the sum of the values observed over 
N > ] trials is expected to be about Nmy. For that reason, we also call m, the 
expected value of X, and we write E[X] or X to stand for the expectation operation 
that yields my. 

To formulate an expression for the statistical average or expectation, we begin 
by considering N independent observations of a discrete RV. If the event X = x; 
occurs N, times, then the sum of the observed values is 


Nux, si Na Xag- = Y Nx, 


Upon dividing by N and letting N — ©, the relative frequency N/N becomes P(X = xj) 
= P(x). Thus, the statistical average value is 


which expresses the mean of a discrete RV in terms of its frequency function P,(x,). 
For the mean of a continuous RV, we replace Py(x) with P(x — dx < X < x) = 
px (X) dx and pass from summation to integration so that 


m= | spe) ds T 


This expression actually includes Eq. (1) as a special case obtained by writing the 
discrete PDF as 


px(x) = pe Py (xi)&(x — xi) 


Hence, when we allow impulses in the PDF, Eq. (2) applies to any RV—continuous, 
discrete, or mixed. Hereafter, then, statistical averages will be written mostly in inte- 
gral form with PDFs. The corresponding expressions for a strictly discrete RV are 
readily obtained by substituting the frequency function in place of the PDF or, more 
directly, by replacing the integration with the analogous summation. 

When a function g(X) transforms X into another random variable Z, its 
expected value can be found from p,(x) by noting that the event X = x transforms 
to Z = g(x), so 


Elec] = | stop d " 


—oo 
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If g(X) — X", then E[X"] is known as the nth moment of X. The first moment, of 
course, is just the mean value E[X] — my. The second moment E[X?] or X ? is called 
the mean-square value, as distinguished from the mean squared my, = X?. Writing 
out Eq. (3) with g(X) = X?, we have 


d oo 
xl x pex) dx 


—00 


or, for a discrete RV, 


X = £d Py(x;) 


The mean-square value will be particularly significant when we get to random sig- 
nals and noise. 


Like time averaging, the expectation in Eq. (3) is a linear operation. Thus, if a 
and B are constants and if g(X) = aX + B, then 


E(oX + B] =aX+B [4] 
Although this result seems rather trivial, it leads to the not-so-obvious relation 
E[XX| = XE[X] = X" 


since X is a constant inside E[ XX]. 


Standard Deviation and Chebyshev’s Inequality 
The standard deviation of X, denoted by oy, provides a measure of the spread of 
observed values of X relative to my. The square of the standard deviation is called the 
variance, or second central moment, defined by 

of Ê ERX — my)’] (5) 


But a more convenient expression for the standard deviation emerges when we 
expand (X — my)? and invoke Eq. (4), so 


E((X — mx), = E[X? — 2mxX + my] = Yi. 2m, X +m = X ~ ml 


and 


oy = V X? -— m [6] 


Hence, the standard deviation equals the square root of the mean-square value 
minus the mean value squared. 


For an interpretation of oy, let k be any positive number and consider the event 
|X — m,| = ko y. Chebyshev's inequality (also spelled Tchebycheff) states that 


P(\X — my| = kox) S 1/k? [7a] 





1 
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regardless of px(x). Thus, the probability of observing any RV outside +k standard 
deviations of its mean is no larger than 1/k?. By the same token, 


P(IX — my| < key) > 1 — 1/k? [75] 


With k = 2, for instance, we expect X to occur within the range my + 2a, for more 
than 3/4 of the observations. A small standard deviation therefore implies a small 
spread of likely values, and vice versa. 

The proof of Eq. (7a) starts by taking Z = X — my and a = ka, > 0. We then let 
€ be a small positive quantity and note that 


sz = | epl) de = | Ppa) de + | z’ pz(z) dz 
"PS -eo a-é€ 
But z = a? over the range a < |z| < oo, so 
=e as 
E[Z*] = a [ P2(z) dz + | pz(z) ae 


where the first integral inside the brackets represents P(Z = —a) whereas the second 
represents P(Z = a). Therefore, 


P(|Z| = a) = P(Z = —a) + P(Z = a) = E[Z?]/a? 
and Eq. (7a) follows by inserting Z = X — my, E[Z] = ai, anda = kay. 


To illustrate the calculation of statistical averages, let's take the case where 
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~alx| 


a 
px) — 5e -%0 <x <% 


with a being a positive constant. This PDF describes a continuous RV with a 
Laplace distribution. . 


Drawing upon the even symmetry of py(x), Eqs. (2) and (3) yield 


my = | x tel dy = 0 E[X?] = al x2 Eeu dy = 
2 2 
Teo 0 
Hence, from Eq. (6), oy = V E[X?] — m} = V2/a. 
The probability that an observed value of a Laplacian RV falls within +20 x of 
the mean is given by 
wala a 
P(|x — 0| < 2V2/a) = | = el dx = 0.94 
-2V2/a 2 


as compared with the lower bound of 0.75 from Chebyshev’s inequality. 
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Let X have a uniform distribution over 0 < X S 27, as in Example 8.2-2 (p. 326). 
Calculate my, X’, and oy. What's the probability of |X — my| < 20x? 





Multivariate Expectations 


Multivariate expectations involve two or more RVs, and they are calculated using 
multiple integration over joint PDFs. Specifically, when g(X,Y) defines a function of 
X and Y, its expected value is 


2 ll EE T 


—0o0 


E|s(X, Y) 


Ld 


However, we'll restrict our attention to those cases in which the multiple integration 
reduces to separate integrals. 

First, suppose that X and Y are independent, so pyy(x,y) = pxCOpy(y). Assume 
further that we can write g(X,Y) as a product in the form g(X,Y) = gx(X)gy(Y). 
Equation (8) thereby becomes 


eco 


E[g(X, Y)] = | nonna " 


= | j e, G)p,Q) af 8,0)p,0) dy 


~o —oo 


= E[gx(X) ]E(eY(Y)] 
If we take g(X; Y) = XY, for instance, then g,(X) = X and g,(Y) = Y so E[XY] = 
E[X]E[Y] or 


Hence, the mean of the product of independent RVs equals the product of their 
means. 


Next, consider the sum g(X, Y) = X + Y, where X and Y are not necessarily inde- 
pendent. Routine manipulation of Eq. (8) now leads to E[X + Y] = E[X] + E[Y] or 


X+Y=X+Y=myt+ my [11] 


Hence, the mean of the sum equals the sum of the means, irrespective of statistical 
independence. 


Finally, let Z = X + Y so we know that mz = my + my. But what's the variance 
c2? To answer that question we calculate the mean-square value E[Z?] via 


Z? = E[X? + 2XY + Y'] =X? -2XY + Y? 
Thus, 


a} = Z? — (my + my)? = (X? — my) + (Y? — mj) + 2(XY — mmy) 


8.3 Statistical Averages 


The last term of this result vanishes when Eq. (10) holds, so the variance of the sum 
of independent RVs is 


o2=0%4+03 [12] 


Equations (9)-(12) readily generalize to include three or more RVs. Keep in 
mind, however, that only Eq. (11) remains valid when the RVs are not independent. 





Let X,, X4, . . . , Xy be sample values obtained from N independent observations of a 
random variable X having mean my and variance o%. Each sample value is an RV, 
and so is the sum 


Z=X +X P uu dx 
and the sample mean 
pc Z/N 


We'll investigate the statistical properties of u, and we'll use them to reexamine the 
meaning of probability. Bay net, » 
From Eqs. (11) and (12) we have Z = X, + X; + © + Xy = Nmy and ox = 
No%. This, jj = Z/N = my, whereas 
1 a ES 


o} = El(u ~ AY] = v; EIZ ~ ZY] 


— 5 g E I Oy 

Ne N* 
Since o, = o'x/ VN, the spread of the sample mean decreases with increasing N, 
and u approaches my as N — ©. Furthermore, from Chebyshev's inequality, the 


probability that u differs from my by more than some positive amount e is upper- 
bounded by 





P( | — mj = e) = o$/Ne 


Although not immediately obvious, this result provides further justification for the 
relative-frequency interpretation of probability. 

To develop that point, let A be a chance-experiment event and let X be a discrete 
RV defined such that X = 1 when A occurs on a given trial and X = 0 when A does 
not occur. If A occurs N, times in N independent trials, then Z = N, and u = NN. 
Thus, our definition of X makes the sample mean u equal to the frequency of occur- 
rence of A for that set of trials. Furthermore, since P(A) = P(X = 1) = P,(1) and 
P(A‘) = P4(0), the statistical averages of X are 


my = 0 X P,(0) + 1 xX P,(1) = P(A) 
E[X?] = 0? x P4(0) + 1? x P,(1) = P(A) 
oy = P(A) — P*(A) 
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SO 
— Pp? 
P(|u — my| z e) = P| » gs P4) = 1 1 (A) 





We therefore conclude that, as N -> ©, N,/N must approach P(A) in the sense that 
the probability of a significant difference between N,/N and P(A) becomes negligi- 
bly small. l 





EXERCISE 8.3-2 Prove Eq. (11) using marginal PDFs as defined by Eq. (18), Sect. 8.2 (p. 330), for 
Px(x) and py). 





Characteristic Functions x% 


Having found the mean and variance of a sum Z = X + Y, we'll now investigate the 
PDF of Z and its relation to py(x) and py(y) when X and Y are independent. This 
investigation is appropriate bere because the best approach is an indirect one using a 
special type of expectation. 

The characteristic function of an RV X is an expectation involving an auxil- 
iary variable v defined by 


y(r) Ê g[e?X] = | el” p(x) dx [13] 
Upon closer inspection of Eq. (13), the presence of the complex exponential e/"* 
suggests similarity to a Fourier integral. We bring this out explicitly by letüng v — 
2Tt and x = f so that 


aon) = |. ne" a= Fa] (1a 
Consequently, by the Fourier integral theorem, 
px(f) = 9[O4(21)] = | De(2rt)e 77" dt [14b] 


Hence, the characteristic function and PDF of a random variable constitute the 
Fourier-transform pair ®,(27t) © p,Cf). 

Now, for the PDF of a sum of independent RVs, we let Z = X + Y and we use 
Eqs. (13) and (9) to write 


y(r) = Ele”) = Ele e] = E[ePX]E(e^*] = Dy) Ov) 
Then, from the convolution theorem, 


pÁf) = FL Dy(Qrt)Py(2at) = px f) * pf) 
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Appropriate change of variables yields the final result 


co 


p2(z) = | . Px — A)py(A) da = | Py(A)p E — A) dà [15] 


Thus, the PDF of X + Y equals the convolution of py(x) and p(y) when X and Y are 
independent. 


Other applications of characteristic functions are explored in problems at the 
end of the chapter. 





Use Eq. (14a) to find ®,(v) for the uniform PDF p,(x) = a^! IIa). EXERCISE 8.3-3 


8.4 PROBABILITY MODELS 


Many probability functions have been devised and studied as models for various 
random phenomena. Here we discuss the properties of two discrete functions (bino- 
mial and Poisson) and two continuous functions (gaussian and Rayleigh). These 
models, together with the uniform and Laplace distributions, cover most of the cases 
encountered in our latter work. Table T.5 at the back of the book summarizes our 
results and includes a few other probability functions for reference purposes. 


Binomial Distribution 


The binomial model describes an integer-valued discrete RV associated with 
repeated trials. Specifically, 





This model thus applies to repeated coin tossing when J stands for the number of 
heads in n tosses and P(H) = a. But, more significantly for us, it also applies to dig- 
ital transmission when J stands for the number of errors in an n-digit message with 
per-digit error probability a. 

To formulate the binomial frequency function P(()) = PU = i), consider any 
sequence of n independent trials in which event A occurs i times. If P(A) = a, then 
P(A‘) = 1 — wand the sequence probability equals o/(1 — o)" *. The number of dif- 
ferent sequences with i occurrences is given by the binomial coefficient, denoted 

j^ SO we have 
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Pi) = ("aa ay i=0O0,l...,n [1] 


These functions were previously evaluated in Fig. 8.2-3 for the case of n = 3 and 
a = 2/5, 


The binomial coefficient in Eq. (1) equals the coefficient of the (i + 1) term in 
the expansion of (a + b)", defined in general by the factorial expression 


n ^ n! 
() — i(n — i)! 2 


where it's understood that 0! = 1 when i = O or i = n. This quantity has the sym- 


metry property 
n n 
() 7 ? es ) 
We thus see that 


«Qe Q«Gue Qe) 


and so on. Other values can be found using Pascal's triangle, tables, or a calculator 
or computer with provision for factorials. 

The statistical averages of a binomial RV are obtained by inserting Eq. (1) into 
the appropriate discrete expectation formulas. Some rather laborious algebra then 
yields simple results for the mean and variance, namely, 


m = na o = no(1 — a) = m(1 — a) [3] 


where we've omitted the subscript J for simplicity. The relative spread o/m decreases 
as 1/ Vn, meaning that the likely values of J cluster around m when n is large. 


Suppose 10,000 digits are transmitted over a noisy channel with per-digit error 


probability a = 0.01. Equation (3) then gives 


m = 10,000 x 0.01 = 100 c? = 100(1 — 0.01) = 99 


Hence, the likely range m + 2c tells us to expect about 80 to 120 errors. 





Poisson Distribution 


The Poisson model describes another integer-valued RV associated with repeated 
trials, in that 


Ll 
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The resulting Poisson frequency function is 


(uTY 
—uT 
dide: 


Pu) 


[4] 
from which 


m= pr oF =m 


These expressions describe random phenomena such as radioactive decay and shot 
noise in electronic devices. 
The Poisson model also approximates the binomial model when n is very large, 
« is very small, and the product næ remains finite. Equation (1) becomes awkward 
to handle in this case, but we can let uT = m in Eq. (4) to obtain the more conven- 
ient approximation 
i 


m 


P,(i) mE e" E [5] 


Neither n nor o appears here since they have been absorbed in the mean value m = na. 





Use Eq. (5) to estimate the probability of J 2 errors when 10,000 digits are trans- EXERCISE 8.4-1 
mitted over a noisy channel having error probability a = 5 X 1075. 





Gaussian PDF 


The gaussian model describes a continuous RV having the normal distribution 
encountered in many different applications in engineering, physics, and statistics. 
The remarkable versatility of this model stems from the famous central limit theo- 
rem, which states in essence that 







x epi seni the so sum e N fadapendenti random Sapni ad if: each component nidkes only ac 
ll-éontribution to. the’sum; then. the:CDF of X approaches, a gaussian CDF as N becomes large, 
regardless of the distribution: of the individual components. 





A more complete statement of the theorem and its proof involves sophisticated 
mathematical arguments and won't be pursued here. However, we'll draw upon the 
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implication that the gaussian model often holds when the quantity of interest results 
from the summation of many small fluctuating components. Thus, for instance, ran- 
dom measurement errors usually cause experimental values to have a gaussian dis- 
tribution around the true value. Similarly, the random motion of thermally agitated 
electrons produces the gaussian random variable known as thermal noise. 


A gaussian RV is a continuous random variable X with mean m, variance c?, 
and PDF 


Px(x) = wae —oo < x < oco [6] 
2mo? 

This function defines the bell-shaped curve plotted in Fig. 8.4-1. The even symme- 
try about the peak at x — m indicates that 


P(X = m) = P(X > m) =} 


so observed values of X are just as likely to fall above or below the mean. 

Now assume that you know the mean m and variance g? of a gaussian RV and 
you want to find the probability of the event X > m + ko. Since the integral in ques- 
tion cannot be evaluated in closed form, numerical methods have been used to gen- 
erate extensive tables of the normalized integral 


1 = -Xf2 
Q(k) £ —— | g ^ gà [7] 
(k) 5x 


The change of variable A = (x — m)/o then shows that 

P(X > m + ko) = Q(k) 
We therefore call Q(k) the area under the gaussian tail, as illustrated by Fig. 8.4-2. 
This figure also brings out the fact that P(n < X = m+ ko) = $ — Q(k), which 
follows from the symmetry and unit-area properties of py(x). 


You can calculate any desired gaussian probability in terms of Q(k) using 
Fig. 8.42 and the symmetry of the PDF. In particular, 


Py(x) 








Figure 8.4-1 Gaussian PDF. 
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Px) 





Figure 8.4-2 Area interpretation of Q(k). 


P(X > m + ko) = P(X € m — ko) = Qk) [8a] 
P(m < X m + ko) = P(n — ko < X = m) =} — Ok) [8b] 
P(|X — m| > ko) = 20(k) [Bd 
P(\X — m| € ko) = 1 — 20(k) [Bd] 


Table 8.4—1 compares some values of this last quantity with the lower bound 1 — I/k? 
from Chebyshev’s inequality. The lower bound is clearly conservative, and the likely 
range of observed values is somewhat less than m + 2c. We usually take the likely 
range of a gaussian RV to be m + o since P(|X — m| = o) ~ 0.68. 


Table 8.4-1 

k 1 — 20(k) 1- Ee 
0.5 0.38 

1.0 0.68 0.00 
1.5 0.87 0.56 
2.0 0.95 0.75 
25 0.99 0.84 





For larger values of k, the area under the gaussian tail becomes too small for 
numerical integration. But we can then use the analytical approximation 


l 
V Dark? 


This approximation follows from Eq. (7) by integration by parts. 
Table T.6 at the back of the book gives a detailed plot of Q(k) for 0 =k « 7. 


Also given are relationships between Q(k) and other gaussian probability functions 
found in the literature. 





Q(k) = gom px [9] 
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EXAMPLE 8.4-2 Suppose you want to evaluate the probability of |X — m| = 3 when X is gaussian 
with a? = 4. You can use Eq. (8d) and Table 8.4—] by noting that o = 2 and ke: = 3 
when k = 3/0 = 1.5. Hence, 


P(X — m| = 3) = P(|X — m| = 150) = 1 - 20(1.5) = 0.87 





Given that X is gaussian with m = 5 and o = 8, sketch the PDF and mark the bound- 
aries of the area in question to show that P(9 < X = 25) = Q(0.5) — Q(2.5) = 0.3. 


EXERCISE 8.4-2 





dv 





Rayleigh PDF 


The Rayleigh model describes a continuous RV obtained from two gaussian RVs as 


follows: 


dpihac iaat a nee Liiris URE cet 





Thus, as shown in Fig. 8.4-3, the Rayleigh model applies to any rectangular-to- 
polar conversion when the rectangular coordinates are identical but independent 
gaussians with zero mean. 


To derive the corresponding Rayleigh PDF, we introduce the random angle «b 
from Fig. 8.4—3 and start with the joint PDF relationship 











Pao (r9) |dr de| = pxy(x, y)|dx dy| 
where 
y2 = x2 +y o = arctan (y/x) dx dy = r dr do 
y 
Figure 8.4-3 Rectangular to polar conversion. 
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Since X and Y are independent gaussians with m = 0 and variance o”, 





loea? 
^? 2 


LTO 


P(x, y) = px(x)p:(y) = 


Hence, including u(r) to reflect the fact that r = 0, we have 





r saajana 
Pror, 9) = ze m u(r) [10] 


270 


The angle ¢ does not appear explicitly here, but its range is clearly limited to 27 
radians. 


We now obtain the PDF for R alone by integrating Eq. (10) with respect to o. 
Taking either 0 < o = 27 or —7 < q x r yields the Rayleigh PDF 
r 2 2 
pir) = —e "P? u(r) [n] 
c 
which is plotted in Fig. 8.4—4. The resulting mean and second moment of R are 
R= Aj = c R = 20? [12] 
For probability calculations, the Rayleigh CDF takes the simple form 


Fr) = P(Rxr)-— t = eat [13] 
derived by integrating p&(A) over 0 x A x r. 
Returning to Eq. (10), we get the marginal PDF for the random angle ® via 


* 1 
Pole) = | Pro(r, €) dr = e 


so ® has a uniform distribution over 27 radians. We also note that pae(r,p) = 
Pr()pe(¢), which means that the polar coordinates R and € are statistically 


independent. These results will be of use in Chapter 10 for the representation of 
bandpass noise. 


P(r) 





Figure 8.4-4 Rayleigh PDF. 





344 CHAPTER8 € Probability and Random Variables 











EXAMPLE 8.4-3 Suppose you throw darts at a target whose bull's eye has a radius of 3 cm. If the rect- a 
angular coordinates of the impact points are gaussianly distributed about the origin E 

with spread o = 4 cm in either direction, then your probability of hitting the bull’s 3 

eye is given by Eq. (13) as 3 

P(R <3) = 1 — e? ~ 25% E 

EXERCISE 8.4-3 Derive Eq. (13) from Eq. (11). 1 


Bivariate Gaussian Distribution * 


Lastly, we want to investigate the joint PDF of two gaussian RVs that are neither 
identically distributed nor independent. As preparation, we first introduce a general 
measure of interdependence between any two random variables. 

Let X and Y be arbitrarily distributed with respective means and variances my, 
my, oy, and c2. The degree of dependence between them is expressed by the corre- 
lation coefficient 


EDS Eee e 


dt Breq fe [14] 


j OxOy 


where the expectation E[(X — my)(Y — my)] is called the covariance of X and Y. At 
one extreme, if X and Y are statistically independent, then E[((X — mY — mp] = 
EX — myj)]E(Y — my)] = 0 so the covariance equals zero and p = 0. At the other 
extreme, if Y depends entirely upon X in the sense that Y = +aX, then c? = (ao yy 
and the covariance equals tao? so p = +1. Thus, the correlation coefficient ranges 
over —1 = p = 1, and |p| reflects the degree of interdependence. 

. When X and Y are interdependent gaussian RVs, their joint PDF is given by the 
bivariate gaussian model 


1 


2mOxocy Vl—p 


e fe -2) [15a] 


Pxy (X. y= 3 


with 


2 


(x — my)? (y — my) = (x — mx)(y — my)p 


15b 
202 20$ Ox Oy iu 


foe») = 
This formidable-looking expression corresponds to a bell-shaped surface over the 
x-y plane, the peak being at x = m, and y = my. 
If p = 0, then the last term of Eq. (155) disappears and pyy(xy) = px(x)pyQ). 
We thus conclude that 


*  Uncorrelated gaussian RVs are statistically independent. 


so eat 
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Further study leads to the additional conclusions that 


The marginal and conditional PDFs derived from the bivariate gaussian PDF 


with any p are all gaussian functions. 


* Any linear combination Z = œX + BY will also be a gaussian RV. 


These three properties are unique characteristics of the bivariate gaussian case, and 
they do not necessarily hold for other distributions. 

All of the foregoing analysis can be extended to the case of three or more jointly 
gaussian RVs. In such cases, matrix notation is usually employed for compact PDF 


expressions. 


8.1-1* 


8.1-2 


8.1-3 


8.1-4 
8.1-5* 
8.1-6 


8.1-7 


8.1-8 


8.5 PROBLEMS 


The outcome of an experiment is an integer J whose value is equally likely to be any 
integer in the range 1 = J < 12. Let A be the event that J is odd, let B be the event 
that J is exactly divisible by 3, and let C be the event that J is exactly divisible by 4. 


Draw the Venn diagram and find the probabilities of the events A, B, C, AB, AC, BC, 
and ACB. 


The outcome of an experiment is an integer J whose value is equally likely to be any 
integer in the range 1 = / = 4. The experiment is performed twice, yielding the out- 
comes J, and J. Let A be the event that J, = L, let B be the event that J, > L, and let 
C be the event that 7, + I, = 6. Draw the Venn diagram and find the probabilities of 
the events A, B, C, AB, AC, BC, and ASB. 


If A and B are not mutually exclusive events, then the number of times A occurs in N 
trials can be written as N, = Nag + Nagc, where Nge stands for the number of times 
A occurs without B. Use this notation to show that P(AB©) = P(A) — P(AB). 


Use the notation in Prob. 8.1—3 to justify Eq. (7), (p. 316). 


Let C stand for the event that either A or B occurs but not both. Use the notation in 
Prob. 8.1-3 to express P(C) in terms of P(A), P(B), and P(AB). 


A biased coin is loaded such that P(H) = (1 + ©)/2 with 0 < |e| < 1. Show that the 
probability of a match in two independent tosses will be greater than 1/2. 


A certain computer becomes inoperable if two components C, and C, both fail. The 
probability that C, fails is 0.01 and the probability that C, fails is 0.005. However, 
the probability that C, fails is increased by a factor of 4 if C, has failed. Calculate 
the probability that the computer becomes inoperable. Also find the probability that 
C, fails if C; has failed. 


An honest coin 1s tossed twice and you are given partial information about the out- 
come. (a) Use Eq. (8) (p. 317) to find the probability of a match when you are told 
that the first toss came up heads. (b) Use Eq. (8) (p. 317) to find the probability of 
a match when you are told that heads came up on at least one toss. (c) Use Eq. (10) 
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(p. 318) to find the probability of heads on at least one toss when you are told that a 
match has occurred. 


8.1-9 Do Prob. 8.1-8 for a loaded coin having P(H) = 1/4. 
8.1-10 Derive from Eq. (9) (p. 318) the chain rule 


P(XYZ) = P(X)P(Y|X)P(Z|XY) 


8.1-11* A box contains 10 fair coins with P(H) = 1/2 and 20 loaded coins with P(H) = 1/4. 
A coin is drawn at random from the box and tossed twice. (a) Use Eq. (11) (p. 318) 
to find the probability of the event "all tails." Let the conditioning events be the hon- 
esty of the coin. (b) If the event "all tails" occurs, what's the probability that the coin 
was loaded? 


8.1-12 Do Prob. 8.1~11 for the case when the withdrawn coin is tossed three times. 


8.1-13 Two marbles are randomly withdrawn without replacement from a bag initially 
containing 5 red marbles, 3 white marbles, and 2 green marbles. (a) Use Eq. (11) 
(p. 318) to find the probability that the withdrawn marbles have matching colors. 
Let the conditioning events be the color of the first marble withdrawn. (b) If the 
withdrawn marbles have matching colors, what's the probability that they are white? 


8.1-14 Do Prob. 8.1—13 for the case when three marbles are withdrawn from the bag. 


8.2-1* — Let X = N^, where N is a random integer whose value is equally likely to be any 
integer in the range —1 = N x 3. Plot the CDF of X and use it to evaluate the prob- 
abilities of: X = 0,2 < X x 3, X « 2, and X z 2. 


8.2-2 Do Prob. 8.2-1 with X = 4 cos 7N/3. 
8.2-3 Let py(x) = xe "u(x). Find F,(x) and use it to evaluate P(X = 1), P(1 < X = 2), and 





P(X > 2). 
8.2-4 Let py(x) = $ el. Find F,(x) and use it to evaluate P(X = 0), P(0 < X = 1), and 
P(X > 1). 
8.2-5* Suppose a certain random variable has the CDF 
0 x=0 
F(x) = 4 Kx? 0<x<10 


100K x > 10 


Evaluate K, write the corresponding PDF, and find the values of P(X = 5) and 
P(5«Xzxf!). 


8.2-6 Do Prob. 8.2—5 with 


0 x=0 
F(x) = 4 K sin 7x/40 0<x=10 | 
K sin 7/4 x > 10 | 





8.2-7 


8.2-8t 
8.2-9* 


8.2-10 
8.2-11 


8.2-12 


8.2-13t 
82-14 


8.2-15* 


8.2-16 
8.2-17 
8.2-18 
8.3-1* 


8.3-2 


8.3-3 


8.3-4 
8.3-5* 
8.3-6 


8.3-7 
8.3-8 
8.3-9* 
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Given that Fy(x) = (a + 2 tan^! xy2sr, find the CDF and PDF of the random vari- 
able Z defined by Z = 0 for X = 0 and Z = X for X > 0. 


Do Prob. 8.277 with Z = —1for X = 0 and Z = X for X > 0. 


Let py(x) = 2e7*u(x). Find the PDF of the random variable defined by the transfor- 
mation Z — 2X — 5. Then sketch both PDFs on the same set of axes. 


Do Prob. 8.2-9 with Z = —2X + 1. 


Let X have a uniform PDF over —1 x x x 3. Find and sketch the PDF of Z defined 
by the transformation Z = V X + 1. 


Do Prob. 8.2-11 with Z = |X|. 
Do Prob. 8.211 with Z = VX). 


Consider the square-law transformation Z = X?. Show that 


pol) = a (bx Vz) + pel V2 juo) 


Find p,(y) when pyyCxy) = ye * Pu(x)u(y). Then show that X and Y are not statis- 
tically independent, and find p,(zly). 

Do Prob. 8.2-15 with py,Goy) = [œ + y?/40]1I(72)11(y/6). 

Show that J?a p, (x|y) dx = 1. Explain why this must be true. 

Obtain an expression for p,Cy|x) in terms of py(x|y) and py(y). 


Find the mean, second moment, and standard deviation of X when py(x) = ae~“u(x) 
with a > 0. 


Find the mean, second moment, and standard deviation of X when py(x) = 
a’xe-“u(x) with a > 0. 


Find the mean, second moment, and standard deviation of X when 


V2 


pase rli (x — ay] 


A discrete RV has two possible values, a and b. Find the mean, second moment, and 
standard deviation in terms of p = P(X = a). 


A discrete RV has K equally likely possible values, 0, a, 2a, .. . , (K — 1)a. Find the 
mean, second moment, and standard deviation. 


Find the mean, second moment, and standard deviation of Y = a cos X, where a is a 
constant and X has a uniform PDF over 0 S x x 0 + 2r. 


Do Prob. 8.3—6 for the case when X has a uniform PDF over 0 = x zx 0 4 7. 
Let Y = aX + B. Show that o, = |alox. 
Let Y = X + B. What value of 8 minimizes E(Y?]? 
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8.3-12 


8.3-13 


8.3-14 


8.3-15* 


8.3-16 


8.3-17t 
8.4-1* 


8.4-2 
8.4-3 


8.4-4 
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Let X be a nonnegative continuous RV and let a be any positive constant. By con- 
sidering E[X], derive Markov's inequality P(X = a) = my,/a. 


Use E[(X = Y)*] to obtain upper and lower bounds on E(XY] when X and Y are not 
statistically independent. 


The covariance of X and Y is defined as Cy, = E[(X — mjY(Y — m,)]. Expand this 
joint expectation and simplify it for the case when: (a) X and Y are statistically inde- 
pendent; (b) Y is related to X by Y= aX + f. 


In linear estimation we estimate Y from X by writing Y —oX- B. Obtain expres- 
sions for a and £ to minimize the mean square error e? = E((Y — Y)’]. 


Show that the nth moment of X can be found from its characteristic function via 


i; D" d"®,(v) 
E[X"] =j Br x 





v=0 

Obtain the characteristic function of X when p;(x) = ae~“u(x) with a > 0. Then use 
the relation in Prob. 8.3—14 to find the first three moments. 

Let X have a known PDF and let Y = g(X), so 


Dy(v) = El e?s] = | 28 p (x) dx 


—o00 


If this integral can be rewritten in the form 


®,(v) = 5 e^ h (A) dA 


-—OQ9 


then py(y) = hly). Use this method to obtain the PDF of Y = X? when p,(x) = 
2axe ^" u(x). 

Use the method in Prob. 8.3—16 to obtain the PDF of Y = sin X when X has a uni- 
form PDF over |x| x 77/2. 


Ten honest coins are tossed. What’s the likely range of the number of heads? What’s 
the probability that there will be fewer than three heads? 


Do Prob. 8.4—1 with biased coins having P(H) = 3/5. 


The one-dimensional random walk can be described as follows. An inebriated 
man walking in a narrow hallway takes steps of equal length /. He steps forward 
with probability a = 3 or backwards with probability 1 — a = 1. Let X be his dis- 
tance from the starting point after 100 steps. Find the mean and standard deviation 
of X. 


A noisy transmission channel has per-digit error probability a = 0.01. Calculate the 
probability of more than one error in 10 received digits. Repeat this calculation 
using the Poisson approximation. 
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A radioactive source emits particles at the average rate of 0.5 particles per second. 
Use the Poisson model to find the probability that: (a) exactly one particle is emitted 
in two seconds; (b) more than one particle is emitted in two seconds. 


Show that the Poisson distribution in Eq. (5), (p. 339), yields E(/] = m and E[P] = 
m? + m. The summations can be evaluated by writing the series expansion for e" 
and differentiating it twice. 


Observations of a noise voltage X are found to have a gaussian distribution with m — 
100 and o = 2. Evaluate X? and the probability that X falls outside the range m = c. 


A gaussian RV has X — 2 and X? — 13. Evaluate the probabilities of the events X > 
Sand2<X 5. . 


A gaussian RV has E[X] = 10 and E[X?] = 500. Find P(X > 20), P(10 < X x 20), 
P(0 < X = 20), and P(X > 0). 


When a binomial CDF has n >> 1, it can be approximated by a gaussian CDF with 
the same mean and variance. Suppose an honest coin is tossed 100 times. Use the 
gaussian approximation to find the probability that: (a) heads occurs more than 70 
times; (b) the number of heads is between 40 and 60. 


Let X be a gaussian RV with mean m and variance o°. Write an expression in terms 
of the Q function for P(a < X x b) with a < m < b. 


A random noise voltage X is known to be gaussian with E[X] = 0 and E[X?] = 9. 
Find the value of c such that |X| < c for: (a) 90 percent of the time; (b) 99 percent of 
the time. 


Write e~*/? dà = —(1/A)d(e 7?) to show that the approximation in Eq. (9), 
(p. 341), is an upper bound on Q(X). Then justify the approximation for k > 1. 


Let X be a gaussian RV with mean m and variance o7. (a) Show that E[(X — m)"] = 0 
for odd n. (b) For even n, use integration by parts to obtain the recursion relation 


E((X — m)"] = (n — 1)o* E[(X - m] 
Then show for n = 2,4,6,... that 
E[(X — my'] = 1:3:5-- (n - Do" 


Let X be a gaussian RV with mean m and variance o”. Show that its characteristic 
. . Er i, 
function is P, (v) = e 7" Petm, 


Let Z = X + Y, where X and Y are independent gaussian RVs with different means 
and variances. Use the characteristic function in Prob. 8.4—15 to show that Z has a 
gaussian PDF. Then extrapolate your results for 


] 4 


where the X; are mutually independent gaussian RVs. 
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A random variable Y is said to be log-normal if the transformation X = In Y yields 
a gaussian RV. Use the gaussian characteristic function in Prob. 8.4-15 to obtain 
E[Y] and E[Y?] in terms of m, and o4. Do not find the PDF or characteristic func- 
tion of Y. 


Let Z = X?, where X is a gaussian RV with zero mean and variance o7. (a) Use Eqs. 
(3) and (13), Sect. 8.3, to show that 


(v) = (1 — j2o?v)'? 


(b) Apply the method in Prob. 8.3—16 to find the first three moments of Z. What sta- 
tistical properties of X are obtained from these results? 


The resistance R of a resistor drawn randomly from a large batch is found to have a 
Rayleigh distribution with R? = 32. Write the PDF p,(r) and evaluate the probabil- 
ities of the events R > 6 and 4.5 < R x 5.5. 


The noise voltage X at the output of a rectifier is found to have a Rayleigh distribu- 
tion with X? = 18. Write the PDF p,(x) and evaluate P(X < 3), P(X > 4), and 
P(3«X«z4) 


Certain radio channels suffer from Rayleigh fading such that the received signal 
power is a random variable Z = X? and X has a Rayleigh distribution. Use Eq. (12), 
Sect. 8.2, to obtain the PDF 


pz(z) = = e7simu(r) | 


where m = E[Z]. Evaluate the probability P(Z = km) fork = 1 and k = 0.1. 


Let R, and R, be independent Rayleigh RVs with E[R?] = E[R}] = 207. (a) Use the 
characteristic function from Prob. 8.4-18 to obtain the PDF of A = R}. (b) Now 
apply Eq. (15), Sect. 8.3, to find the PDF of W = Ri + R3. 


Let the bivariate gaussian PDF in Eq. (15), (p. 344), have m, = my = O and ay = oy 
= ø. Show that p,(y) and py(x|y) are gaussian functions. 

Find the PDF of Z = X + 3Y when X and Y are gaussian RVs with my = 6, my = —2, 
Oy = Oy = 4, and E[XY] = —22. 


Let X = Y?, so X and Y are not independent. Nevertheless, show that they are uncor- 
related if the PDF of X has even symmetry. 
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|| meaningful communication signals are unpredictable or random as viewed from the receiving end. Otherwise, 

there would be little value in transmitting a signal whose behavior was completely known beforehand. Furiher- 
more, all communication systems suffer to some degree from the adverse effects of electrical noise. The study of random 
signals and noise undertaken here is therefore essential for evaluating the performance of communication systems. 

Sections 9.1 and 9.2 of this chapter combine concepts of signal analysis ond probability to construct mathe- 
matical models of random electrical processes, notably random signals and noise. Don't be discouraged if the mate- 
rial seems rather theoretical and abstract, for we'll put our models to use in Sects. 9.3 through 9.5. Specifically, Seci. 
9.3 is devoted to the descriptions of noise per se, while Sects. 9.4 and 9.5 examine signal transmission in the pres- 
ence of noise. Most of the topics introduced here will be further developed and extended in later chapters of the text. 


OBJECTIVES 





After studying this chapter and working the exercises, you should be able to do each of the following: 
l. Define the mean and autocorrelation function of a random process, and state the properties of a stationary or 
gaussian process (Sect. 9.1) 
2.  Relate the time and ensemble averages of a random signal from an ergodic process (Sect. 9.1). 


3. Obtain the mean-square value, variance, and power spectrum of a stationary random signal, given its autocorrela- 
tion function (Sect. 9.2). 


4.  Findthe power spectrum of a random signal produced by superposition, modulation, or filtering (Sect. 9.2). 
5. Write the autocorrelation and spectral density of white noise, given the noise temperature (Sect. 9.3). 


6. Calculate the noise bandwidth of a filter, and find the power spectrum and total output power with white noise at 
the input (Sect. 9.3). 


7. State the conditions under which signal-to-noise ratio is meaningful (Sect. 9.4). 
Analyze the performance of an analog baseband transmission system with noise (Sect. 9.4). 
9.  Findthe optimum filter for pulse detection in white noise (Sect. 9.5). 


10. Analyze the performance of a pulse transmission system with noise (Sect. 9.5). 


£9 








9.1 RANDOM PROCESSES 


A random signal is the manifestation of a random electrical process that takes place 
over time. Such processes are also called stochastic processes. When time enters the 
picture, the complete description of a random process becomes quite complicated— 
especially if the statistical properties change with time. But many of the random 
processes encountered in communication systems have the property of stationarity 
or even ergodicity, which leads to rather simple and intuitively meaningful relation- 
ships between statistical properties, time averages, and spectral analysis. 

This section introduces the concepts and description of random process and 
briefly sets forth the conditions implied by stationarity and ergodicity. 
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Ensemble Averages and Correlation Functions 


Previously we said that a random variable maps the outcomes of a chance experiment 
into numbers X(s) along the real line. We now include time variation by saying that 






A Cio p SNL 
We'll represent ensembles formally by v(t,s). When the process in question is elec- 
trical, the sample functions are random signals. 

Consider, for example, the set of voltage waveforms generated by thermal elec- 
tron motion in a large number of identical resistors. The underlying experiment 
might be: Pick a resistor at random and observe the waveform across its terminals. 
Figure 9.1-1 depicts some of the random signals from the ensemble v(t,5) associated 
with this experiment. A particular outcome (or choice of resistor) corresponds to the 
sample function vt) = v(ts) having the value v,(t,) = v(t;,s;) at time t. If you 
know the experimental outcome then, in principle, you know the entire behavior of 
the sample function and all randomness disappears. 

But the basic premise regarding random processes is that you don't know which 
sample function you're observing. So at time t;, you could expect any value from the 
ensemble of possible values v(t;,s). In other words, v(t;,s) constitutes a random vari- 
able, say V;, defined by a “vertical slice" through the ensemble at t = f, as illus- 
trated in Fig. 9.1~1. Likewise, the vertical slice at t; defines another random variable 
V5. Viewed in this light, a random process boils down to a family of RVs. 
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Figure 9.1-1 Waveforms in an ensemble vl, s]. 
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naarn Vect 
/? ' 
Now let's omit s and represent the random process by v(?), just as we did when we 
used X for a single random variable. The context will always make it clear when we're 
talking about random processes rather than nonrandom signals, so we'll not need a 
more formal notation. (Some authors employ boldface letters or use V(t) for the ran- 
dom process and V, for the random variables.) Our streamlined symbol v(t) also agrees 
E: with the fact that we hardly ever know nor care about the details of the underlying 


uem experiment. What we do care about are the statistical properties of v(f). 





i Va For a given random process, the mean value of v(® at arbitrary time t is 
i : defined as 
ger A c op fey frc, TEDA 
ye (UIT Ta v(r) $ E[w()] n 
—«I | g 
p Here, E[v(r)] denotes an ensemble average obtained by averaging over all the sam- 





MC 


> ple functions with time ¢ held fixed at an arbitrary value. Setting t = ż then yields 
; ka z ^ * E[v(t)] = V,, which may differ from Vz. 
pu Efa tal) 





To investigate the relationship between the RVs V, and V, we define the auto- 
L “': correlation function 
YU olio. eit. ^re ca : 


Xu. 
dn (TOW Lo YI 


Ru(ty 15) È E[v(t))v()] (2 
ni i Alone A Y where the lowercase subscript has been used to be consistent with our previous work 
in Chapter 3. This function measures the relatedness or dependence between V, and 
V2. If they happen to be statistically independent, then R,(t;, tj) = V,V;. However, 
if t = t, then V, = V, and R(t, t) = Vi. More generally, setting t£; = 
t, = t yields 





R,(t, f) = E[v^(0)) = v(t) 


which is the mean-square value of v(f) as a function of time. 
Equations (1) and (2) can be written out explicitly when the process in question 
involves an ordinary random variable X in the functional form 


v(t) = g(X, 1) 


Thus, at any time £j, we have the RV transformation V; = g(X,tj). Consequently, 
knowledge of the PDF of X allows you to calculate the ensemble average and the 
autocorrelation function via 
/ 
y 


fu. WD = Eed) = | ae texte) d [3al 


MO p 





"7-09 


oo 
Ry (ti, t2) = E[g(X, t1)9(X, t2)] = | g(x, ti)eGs t2)px(x) dx [35] 
—O00 
Equations (3a) and (35) also generalize to the case of a random process defined in 
terms of two or more RVs. If v(t) = (X,Y, ù, for instance, then Eq. (3b) becomes 
Rtt) = EX, Y )gOG Y h). 
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Occasionally we need to examine the joint statistics of two random processes, 
say u(t) and w(t). As an extension of Eq. (2), their relatedness is measured by the 
crosscorrelation function 


A 
Rh, t5) is E|v(tj)w(t;)] [4] 
The processes are said to be uncorrelated if, for all t; and ¢,, 
Ry (ty, t2) = v(t) X wt) [5] Ey 


Physically independent random processes are usually statistically independent 
and, hence, uncorrelated. However, uncorrelated processes are not necessarily 
independent. 





Consider the random processes u(t) and w(t) defined by EXAMPLE 9.1-1 
vt) =t+X w(t) ^tY 


where X and Y are random variables. Although the PDFs of X and Y are not given, 


we can still obtain expressions for the ensemble averages from the corresponding 
expectation operations. 


The mean and autocorrelation of u(t) are found using Eqs. (3a) and (3d), keep- 
ing in mind that time is not a random quantity. Thus, 


v(t) = Elt + X] =t+ E[X]=t+X 
R(t, t) = El(t, + X)(t5 + X)] 
= Eltyt, + (t, + t)X + X?] 2 tt + (t, +4)X + XX 
Likewise, for w(t), 
w(t) = E[tY]=t¥ — Ry (ty, b) = E[tYtY] = tih Y? 
Taking the crosscorrelation of v(t) with w(t), we get 
Rit, b) = E((t; + X)5Y] = E[t5Y + 5XY] = t6Y + 5XY 
If X and Y happen to be independent, then XY = X x Y and 
Royy(ty to) = tjtSY + XY = (t + X)(65Y) = wt) x w(t) 


so the processes are uncorrelated. 





Randomly Phased Sinusoid EXAMPLE 9.1-2 


Suppose you have an oscillator set at some nonrandom amplitude A and frequency 
€), but you don’t know the phase angle until you turn the oscillator on and observe 
the waveform. This situation can be viewed as an experiment in which you pick an 
oscillator at random from a large collection with the same amplitude and frequency 
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but no phase synchronization. A particular oscillator having phase angle o; gener- 
ates the sinusoidal sample function vt) = A cos (wot + qj), and the ensemble of 
sinusoids constitutes a random process defined by 


u(t) = Acos (wt + ®) we 


where ® is a random angle presumably with a uniform PDF over 27 radians. We’ ll 
find the mean value and autocorrelation function of this randomly phased sinusoid. 

Since v(f) is defined by transformation of ®, we can apply Eqs. (3a) and (35) 
with g(®, t) = A cos (wot + P) and pole) = 1/27 for 0 < ọ = 277. As a preliminary 
step, let n be a nonzero integer and consider the expected value of cos (a + nd). 
Treating « as a constant with respect to the integration over g, 





Ejcos (a  nb)] = | cos (à nb) pe (dois B cos (a3: nd)— de 


a 0 
= [sin (e + 2n) —sina]/207n 0 n*£0 


But, with n = 0, E[cos a] = cos «a because cos a does not involve the random vari- 
able ®. 


Now we find the mean value of v(t) by inserting g(,7) with a = wot into Eq. 
(3a), so 


v(t) = E[g(®, t)] = AE[cos (at + 6)] = 0 


which shows that the mean value equals zero at any time. Next, for the autocorrela- 
tion function, we use Eq. (35) with o, = wot, and a) = wot. Trigonometric expan- 
sion then yields 


Rt, t2) = E[A cos (a, + ®) X A cos (æ, + ®)] 

= (A?/2) E[cos (a; — a2) + cos (a, + a, + 20)] 

= (A?/2){E[cos (a, — o;)] + Elcos (a, + a, + 20)]) 
(A?/2) {cos (a, — a) + 0} 


and hence 
R(t), t2) = (A’/2) cos w(t, — t) 
Finally, setting t; = ft, = t gives 
vr) = R(t, t) = A?/2 


so the mean-square value stays constant over time. 





EXERCISE 9. E 1 Let "E = KI + 3 where Xi is an m RV With x = 0 ad xi- = T Show that TE = 31 
and R,(t,,t2) = 5  9t,t. 
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Ergodic and Stationary Processes 


The randomly phased sinusoid in Example 9.1—2 illustrates the property that some 
ensemble averages may equal the corresponding time averages of an arbitrary sam- 
ple function. To elaborate, recall that if g[u,(f)] is any function of v;(t), then its time 
average is given by 


T/2 
«abi))» - dm | sloda 


-Tf 


With v/(t) = a cos (wot + vj), for instance, time averaging yields «vj(r)^ = 0 = 
E[u(t)] and «vXt)» = ia? = E[v*(t)]. 

Using a time average instead of an ensemble average has strong practical appeal 
when valid, because an ensemble average involves every sample function rather than 
just one. We therefore say that 





This means that we can take time averages of one sample function to determine or at 


AD Sie i 
least estimate ensemble averages. CT l naela l 
. y The definition of ergodicity requires that an ergodic process has <g[u,#)|> = Tempore 





i PÈ (g[v(0]) for any v(t) and any function g[v;(t)]. But the value of «g[v((2] must 
be independent of 7, so we conclude that 





o2 
The randomly phased sinusoid happens tóvergodic, whereas the process in Exercise 
9.1—1 is not since E[v(0)] varies with time. 
When a random signal comes from an ergodic source, the mean and mean- 
square values will be constants. Accordingly, we write manama. 


E ay _ Ma DN 
Ev())-v-m, E] = P = oh t my . l6 


where my and c? denote the mean and variance of v(t). Then, observing that any 
sample function has <v(ġ> = E[v()) and «v?(r)» = E[v?(0n], we can interpret 
certain ensemble averages in more familiar terms as follows: 


l. The mean value my equals the dc component <v (N>. — 





2. The mean squared mj equals the dc power <v,(t)>. 
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3. The mean-square value v? equals the total average power «v». 


The variance c? equals the ac power, meaning the power in the time-varying 
component. 


t 
ROOT Spann (YA. 


5. The standard deviation a equals the rms value of the time-varying component. 


These relations help make an electrical engineer feel more at home in the world of 
random processes. 

Regrettably, testing a given process for ergodicity generally proves to be a 
daunting task because we must show that <g(v,(t)]> = E{g[v(t)]} for any and all 
g[v(t)]. Instead, we introduce a useful but less stringent condition by saying that 





Expressed in mathematical form, wide-sense stationarity requires that 
E|v(t)] = m, . Rt. t) = R(t) — n) i7] 


Any ergodic process satisfies Eq. (7) and thus is wide-sense stationary. However, 
stationarity does not guarantee ergodicity because any sample function of an 
ergodic process must be representative of the entire ensemble. Furthermore, an 
ergodic process is strictly stationary in that all ensemble averages are independent 
of time. Hereafter, unless otherwise indicated, the term stationary will mean wide- 
sense stationary per Eq. (7). 

Although a stationary process is not necessarily ergodic, its autocorrelation 
function is directly analogous to the autocorrelation function of a deterministic sig- 
nal. We emphasize this fact by letting 7 = f — t; and taking either 5j = tor t = tto 
rewrite R(t; — t5) as 


R(t) = Elv(r)w(t - 7)] = Elo + 7)v(2] [8] 
Equation (8) then leads to the properties 
R(-7) = R(7) | [9l 
R,(0) =v? = o2 + m [95] 
IR, (T)| = R,(0) [9c] 


so the autocorrelation function R,(T) of a stationary process has even symmetry 
about a maximum at 7 = 0, which equals the mean-square value. 

For 7 # 0, R(T) measures the statistical similarity of v(t) and u(t = 7). On the 
one hand, if v(t) and v(t + T) become independent as T > co, then 


R,(+00) = 9? = m} (10] 
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On the other hand, if the sample functions are periodic with period To, then 
R(T = nTy) = R(T) n=1,2,... [11] 


and R,(7) does not have a unique limit as |r| — oo. 

Returning to the randomly phased sinusoid, we now see that the stationarity 
conditions in Eq. (7) are satisfied by E[v(ġ] = 0 and R,(t,,t2) = (47/2) cos w(t, — t) 
= R(t — t). We therefore write R,(7) = (A?/2) cos or, which illustrates the prop- 
erties in Eqs. (9a)-(9c). Additionally, each sample function v;( = A cos (wot + c; 
has period T) = 27r/w and so does R,(7), in agreement with Eq. (11). 

Finally, we define the average power of a random process u(t) to be the 
ensemble average of «v?(t)7», so 


P2 g«v(n»] = «Epp» [12] 


This definiüon agrees with our prior observation that the average power of an 
ergodic process is <v;(t)> = v?, since an ergodic process has E[v?(2)] = «v(t)» 
and «E[v?()]2 = E[v?(7n] when E[v?(t)] is independent of time. If the process is 
stationary but not necessarily ergodic, then E[v?(2] = R,(0) and Eq. (12) reduces to 


P = R,(0) [13] 


All stationary processes of practical interest have R,(0) > 0, so most of the sample 
functions are power signals rather than finite-energy signals. 





Random Digital Wave 


The random digital wave comes from an ensemble of rectangular pulse trains like 
the sample function in Fig. 9.1—2a. All pulses have fixed nonrandom duration D, but 
the ensemble involves two random variables, as follows: 


1. The delay T, is a continuous RV uniformly distributed over 0 < ¢, = D, indi- 
cating that the ensemble consists of unsynchronized waveforms. 


2. The amplitude a, of the kth pulse is a discrete RV with mean E[a,] = O and vari- 
ance c?, and the amplitudes in different intervals are statistically independent 
so E[aja,] = E[aj|E[a,] = 0 for j # k. 


Note that we're using the lowercase symbol a; here for the random amplitude, and that 
the subscript k denotes the sequence position rather than the amplitude value. We'll 
investigate the stationarity of this process, and we'll find its autocorrelation function. 

Consider the kth pulse interval defined by kD + T, < t < (k + 1)D + T, and 
shown in Fig. 9.1-2b. Since v(t) = a, when t; falls in this interval, and since all such 
intervals have the same statistics, we conclude that 


Ew] Eja] -0 —E(v()- Ea] = o? 


EXAMPLE 9.1-3 
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Figure 9.1-2 Random digital wave. (a) Sample function; (b] kth pulse interval. 


Being independent of time, these results suggest a stationary process. To complete 
the test for wide-sense stationarity, we must find R,(t,,£5). However, since the proba- 
bility function for the pulse amplitudes is not known, our approach will be based on 
the expectation interpretation of the ensemble average E[v(t;)v(1;)] when t, and t; 
fall in the same or different pulse intervals. 

Clearly, t; and £; must be in different intervals when |t; — t| > D, in which case 
v(t) = a; and v(t5) = a, with j kso 


Elv(t))u(t.)] = Elaja] = 0 Ih -—1| >D 
But if |t; — t| < D, then either t, and t; are in adjacent intervals and E[v(t,)v(t,)] = 0, 
or else t; and t; are in the same interval and E[v(t,)u(t,)] = Elaz] = o?. 
We therefore let A stand for the event “t, and ż in adjacent intervals” and write 
E[v(t)u(t;)] = Elaja,|P(A) + E[az]|1 — P(A)] 
—-gt1-P(4)] |5-t1| «D 


From Fig. 9.1-2b, the probability P(A) involves the random delay T, as well as t, 
and t,. For f, < t; as shown, f, and t; are in adjacent intervals if t, < kD + T; < h, and 
t= kD 
P - kD «T, «  - $5) = | — di = 
t,—kD 


t — ty 
D 





Including the other case when ft, < t;, the probability of 1; and t; being in adjacent 
intervals is 


P(A) = |t — t,|/D 
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Ry(T) 
3 
mp 9 D 
Figure 9.1-3 Autocorrelation of the random digital wave. 


and hence, 
E[v(t)v(5)] = &'(1 — |t — tl/D] |; — t| <D 
Combining this result with our previous result for |t; — t,| > D, we have 


ht 
R(tit2) = zi e El |I5- t| «D 


Since R,,(t,,f2) depends only on the time difference t, — t, the random digital wave 
is wide-sense stationary. 


Accordingly, we now let r = f, — t; and express the correlation function in the 
compact form 


Rr) = e?^A(1/D) 


where A(7/D) is the triangle function. The corresponding plot of R,(7) in Fig. 9.1-3 
deserves careful study because it further illustrates the autocorrelation properties 
stated in Eqs. (9a)-(9c), with my = 0 and v? = o? + m? = o°. 

The average power of this process is then given by Eq. (13) as 


P=R,(0) = o? 


However, the process is not ergodic and the average power of a particular sample 
function could differ from P. By way of example, if v,(t) happens to have a, = dp for 
all k, then <vu?()> = <ai> = a} * P. We use P as the “best” prediction for the 
value of <vu?(t)> because we don't know the behavior of u,(t) in advance. 











362 CHAPTER? € Random Signals and Noise 


Gaussian Processes 


A random process is called a gaussian process if all marginal, joint, and conditional 
PDFs for the random variables V; = v(tj) are gaussian functions. But instead of find- 
ing all these PDFs, we usually invoke the central-limit to determine if a given 
process is gaussian. Gaussian processes play a major role in the study of communi- 
cation systems because the gaussian model applies to so many random electrical 
phenomena—at least as a first approximation. 

Having determined or assumed that v(t) is gaussian, several important and 
convenient properties flow therefrom. Specifically, more advanced investigations 
show that: 


1. The process is completely described by E[u()] and R,(t,,t;). 


2. If R(t) = E[v(t))]E|v(t;)], then v(t,) and v(t) are uncorrelated and statisti- 
cally independent. 


UJ 


If u(t) satisfies the conditions for wide-sense stationarity, then the process is 
also strictly stationary and ergodic. 


4. Any linear operation on u(t) produces another gaussian process. 
These properties greatly simplify the analysis of random signals, and they will be 


drawn upon frequently hereafter. Keep in mind, however, that they hold in general 
only for gaussian processes. 





By considering R,(1,,/)), determine the properties of w(t) = 2u(¢) — 8 when v(7) is a 
gaussian process with E[u(r)] = 0 and R,(t,,t;) = 9675m, 


EXERCISE 9.1-3 





9.2 RANDOM SIGNALS 


This section focuses on random signals from ergodic or at least stationary sources. 
We'll apply the Wiener-Kinchine theorem to obtain the power spectrum, and we'll 
use correlation and spectral analysis to investigate filtering and other operations on 
random signals. 


Power Spectrum 


When a random signal v(t) is stationary, then we can meaningfully speak of its 
power spectrum G,(f) as the distribution of the average power P over the frequency 
domain. According to the Wiener-Kinchine theorem, G, (f) is related to the auto- 
correlation R,(T) by the Fourier transform 


GP) = FIR] Ê [ Rea ila 


7—00 





ium 
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Conversely, 


Rin) = Fans | atra (1B 


Thus, the autocorrelation and spectral density constitute a Fourier transform pair, 
just as in the case of deterministic power signals. Properties of G,(f) include 


iE G(f) df = R(0) = v? =P [2] 
G(f)2=0 GOP) =G) [3] 


The even-symmetry property comes from the fact that R (T) is real and even, since 
v(t) is real. 

The power spectrum of a random process may be continuous, impulsive, or 
mixed, depending upon the nature of the source. By way of illustration, the ran- 
domly phased sinusoid back in Example 9.1—2 has 

A? A? A? 
R(t) =F cos2mfr > Gf) =F (Ff) +> BA) A 
The resulting impulsive spectrum, plotted in Fig. 9.2-1a, is identical to that of a 
deterministic sinusoid because the randomly phased sinusoid comes from an 
ergodic process whose sinusoidal sample functions differ only in phase angle. As 
contrast, the random digital wave in Example 9.1-3 has 


R(T) = o?N(7/D) €» G(f) = a?D sinc? fD [5] 


Figure 9.2—1b shows this continuous power spectrum. 

Since the autocorrelation of a random signal has the same mathematical proper- 
ties as those of a deterministic power signal, justification of the Wiener-Kinchine 
theorem for random signals could rest on our prior proof for deterministic signals. 
However, an independent derivation based on physical reasoning provides addi- 
tional insight and a useful alternative definition of power spectrum. 








A A P 
4 E 
f f 
=J 0 Jo ee 
D D 
(a) (b) 
Figure 9.2-1 Power spectra. {a} Randomly phased sinusoid; (b) random digital wave. 
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Consider the finite-duration or truncated random signal 


a wt) lj < T/2 
uu n ld» T/2 


Since each truncated sample function has finite energy, we can introduce the Fourier 
transform 


co T/2 
Vf s) = | v, (t)e ^" dt = | v(t)e " dt [6] 
—-o00 -T/A 
Then |Vi(f,s)l? is the energy spectral density of the truncated sample function 
Ur (t,5;). Furthermore, drawing upon Rayleigh's energy theorem in the form 


m v^(t) dt = [ uz(t) dt = [ Veh, s)? df 


-TIh —oo — 


the average power of v(t) becomes 


T—o0o 


T/2 
P = lim | E[v*(1) | dt 
-T 


co 


qa 1 " 2 = " 1 2 
= ime >| IVzCf, s)| 4f = üm z E[IV«(f, s)?] af 
Accordingly, we now define the power spectrum of v(t) as 


GAP) Ê dim = ELIE 97] m 


which agrees with the properties in Eqs.(2) and (3). 

Conceptually, Eq. (7) corresponds to the following steps: (1) calculate the 
energy spectral density of the truncated sample functions, (2) average over the 
ensemble, (3) divide by T to obtain power, and (4) take the limit T — ~. Equation 
(7) provides the basis for experimental spectra estimation. For if you observe a sam- 
ple function v(t,5;) for a long time T, then you can estimate G,( f) from 


GA) = z I Gs 


The spectral estimate G,( f ) is called the periodogram because it originated in the 
search for periodicities in seismic records and similar experimental data. 

Now, to complete our derivation of the Wiener-Kinchine theorem, we outline 
the proof that G,(f) in Eq. (7) equals F (R,(r)]. First, we substitute Eq. (6) into 
EVAN] = E(Vz CA S)V;Cf.5)] and interchange integration and expectation to get 


ev. P] = |] Bete o(ay ie" ae an 


-T2 


T/ 





i 
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Figure 9.2-2 Integration region in the 7 — u plane. 


in which 
E[v()v(4)] = R(t A) = R(t — A) 
Next, let 7 = t — A and u = t so the double integration is performed over the region 


of the 7 — u plane shown in Fig. 9.2-2. Integrating with respect to u for the two 
cases T < O and 7 > Q then yields 


E| ivf, 8] = [ «e ( | m du Jar £ [Rate | d du je 


-7/2 -T 


| l Rye (T + r)dr + [ Roe (T - 7)dr 


-T 
Finally, since r = —|r| for r < 0, we have 
2l - j a M 
E||V«Cf. s)| ] ST Does T Rive" dr [8] 
-T 
"Therefore, 


nim 5. rw s)P] = [ nece 


—Cco 


which confirms that G,(f) = ¥,[R,(7)]. 





Random Telegraph Wave 
Figure 9.2—3a represents a sample function of a random telegraph wave. This signal 


makes independent random shifts between two equally likely values, A and 0. The 


number of shifts per unit time is governed by a Poisson distribution, with u being 
the average shift rate. 


EXAMPLE 9.2-1 
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u(t) 
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Random telegraph wave. (c) Sample function; (b) autocorrelation; (c) power spectrum. 


We'll find the power spectrum given the autocorrelation function 
A 
R(T) = 4€ sue eq 


which is sketched in Fig. 9.2—35. From R,(T) we see that 


€ A 3 A? 
P=P RO=% m= R0) =4 


so the rms value is oy = V v? — mj, = A/2. 


Taking the Fourier transform of R,(7) gives the power spectrum 
A? A 
OUS m Uer ou) 
aul + (xf/nY] 


padankan fia aah 
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which includes an impulse at the origin representing the dc power mg = A?/4. This 
mixed spectrum is plotted in Fig. 9.2-3c. Although u equals the average shift rate, 
about 20 percent of the ac power (measured in terms of 0) is contained in the 
higher frequencies |f] > p. 





To confirm in general that G,(f) includes an impulse when my # 0, let z(f) = v(t) ^ my 
and show from R,(7) that G,(f) = GAP) + m?&(f). 


EXERCISE 9.2-1 





Superposition and Modulation 


Some random signals may be viewed as a combination of other random signals. In 
particular, let u(t) and w(t) be jointly stationary so that 


Rosy t2) m Ry,(ti ps t;) 


and let 
at) = v(t) = w(t) [9] 
Then 
R(t) = R(t) + R(t) = [R(v) + R(t) 
and 


I+ 


G(f) = GF) + GC) = [GG + Golf] 
where we have introduced the cross-spectral density 
GU) È &[R()] 110) 


The cross-spectral density vanishes when u(t) and w(?) are uncorrelated and mymy = 0, 
so 


R, (7) = Ry(t) =0 [11a] 
Under this condition . 
RLT) = R,(T) + R, (7) [115] 
G(f) = Gf) + GC) (idl 
Z=v tw? Ur 


Thus, we have superposition of autocorrelation, power spectra, and average power. 

When Eq. (11a) holds, the random signals are said to be incoherent. Signals 
from independent sources are usually incoherent, and superposition of average 
power is a common physical phenomenon. For example, if two musicians play in 
unison but without perfect synchronization, then the total acoustical power simply 
equals the sum of the individual powers. 
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Furthermore, the mean value of the output is 


my = | MAJAN | = H(0)my [20] 
where H(0) equals the system's dc gain. 

The power spectrum relation in Eq. (18) has additional value for the study of 
linear operations on xandom signals, whether or not filtering is actually involved. In 
particular, suppose that you know G,(f) and you want to find G,(f) when y(t) = 
dx(t)/dt. Conceptually, y(t) could be obtained by applying x(t) to an ideal differentia- 
tor which we know has H(f) = j2af. We thus see from Eq. (18) that if 


y(t) = dx(t)/dt [21a] 
then 
G(f) = QafY Gf) [215] 
Conversely, if 
t 
ab | x(A)dA — mp = 0 [22] 
then 
G(f) = Qaf) ^ &Cf) [22b] 
These relations parallel the differentiation and integration theorems for Fourier 
transforms. 





EXAMPLE 9.2-2 Let the random telegraph wave from Example 9.2-1 be applied to an ideal bandpass 
filter with unit gain and narrow bandwidth B centered at f, = p/m. Figure 9.2-5 
shows the resulting output power spectrum G,(f) = IHCO PG P). 

With G+) = A?/8u and B < f., we have 


fA Bu -B/2<|f- fi < B/2 
Gre is otherwise 


and the output power equals the area under G,(f), namely 
y! = 2B x (A?/8u) = A?B/Ay « A? /An 


whereas x? — A?/2. Moreover, since H(0) = 0, we know from Eq. (20) that my = 0 
even though my = A/2. Note that we obtained these results without the added labor of 
finding R,(7). 


npe nma nnm nae 
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Figure 9.2-5 Filtered power spectrum in Example 9.2-2. 





Hilbert Transform of a Random Signal EXAMPLE 9.2-3 
The Hilbert transform x(t) was previously defined as the output of a quadrature 
phase-shifting filter having 


1 
holt) - — Bf) = ~jsenf 


Since |Hg(f)|* = 1, we now conclude that if x(#) is a random signal and y(t) = 
X(t), then 
GAF) = GO) — R(t) = 87 [G)] = Ré) 


Thus, Hilbert transformation does not alter the values of my or x, 
However, from Eq. (17a), 


Rer) = ho(r)*R,(r) = Rr) 
where R(T) stand for the Hilbert transform of R,(r). It can also be shown that 
Realt) = - R7) 
We'll apply these results in the next chapter. 





Let the random digital wave described by Eq. (5) (p. 363) be applied to a first-order 
LPF with H(f) = [1 + j(f/B)]! and B < 1/D. Obtain approximate expressions for 
G,(f) and R,(7). 


EXERCISE 9.2-3 








9.3 NOISE 


Unwanted electric signals come from a variety of sources, generally classified 
as either human interference or naturally occurring noise. Human interference is 
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produced by other communication systems, ignition and commutator sparking, ac 
hum, and so forth. Natural noise-generating phenomena include atmospheric distur- 
bances, extraterrestrial radiation, and random electron motion. By careful engineer- 
ing, the effects of many unwanted signals can be reduced or virtually eliminated. 
But there always remains certain inescapable random signals that present a funda- 
mental limit to system performance. 

One unavoidable cause of electrical noise is the thermal motion of electrons in 
conducting media—wires, resistors, and so on. Accordingly, this section begins with 
a brief discussion of thermal noise that, in turn, leads to the convenient abstraction 
of white noise—a useful model in communication. We then consider filtered white 
noise and input-output relations. Other aspects of noise analysis are developed in 


subsequent chapters and the appendix. In particular, the special properties of band- 
pass noise will be discussed in Sect. 10.1. 


Thermal Noise and Available Power 
For our purposes, 





From kinetic theory, the average energy of a particle at absolute temperature 9 is 
proportional to KJ , k being the Boltzmann constant. We thus expect thermal-noise 
values to involve the product kJ. In fact, we'll develop a measure of noise power in 
terms of temperature. Historically, Johnson (1928) and Nyquist (1928b) first studied 
noise in metallic resistors—hence, the designation Johnson noise or resistance 
noise. There now exists an extensive body of theoretical and experimental studies 
pertaining to noise, from which we'll freely draw. 

When a metallic resistance R is at temperature J, random electron motion pro- 
duces a noise voltage u(t) at the open terminals. Consistent with the central-limit 
theorem, u(t) has a gaussian distribution with zero mean and variance 

sj aL 2(qk9 y " 
U Ty = 3h R V [11 


where J is measured in kelvins (K) and 


k = Boltzmann constant = 1.38 x 107” J/K 


h = Planck constant = 6.62 X 10 ?^ J.s 


The presence of the Planck constant in Eq. (1) indicates a result from quantum theory. 
The theory further shows that the mean square spectral density of thermal noise is 
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2Rh| f| 
ef _ ] 


Gf) = V?/Hz [2a] 


which is plotted in Fig. 9.3-1 for f = 0. This expression reduces at “low” frequen- 
cies to 


Alf 
T 


kT 
f| « — [2bj 
h 
Both Eq. (2a) and (2b) omit a term corresponding to the zero-point energy, which is 
independent of temperature and plays no part in thermal noise transferred to a load. 
Fortunately, communication engineers almost never have need for Eqs. (2a) and 
(2b). To see why, let room temperature or the standard temperature be 


Gf) 2a (1 - 


Ty Ê 290 K (63°F) [3a] 
which is rather on the chilly side but simplifies numerical work since 
kJa =~ 4 X 107°?! W-s [35] 


If the resistance in question is at J, then G,(f) is essentially constant for |f| < 
O.1k9p/h ~ 10!? Hz. But this upper limit falls in the infrared part of the electromag- 
netic spectrum, far above the point where conventional electrical components have 
ceased to respond. And this conclusion holds even at cryogenic temperatures (J = 
0.0013). 

Therefore, for almost all purposes we can say that the mean square voltage 
spectral density of thermal noise is constant at 


G(f) =2RkT  V2/Hz [4] 


obtained from Eq. (25) with Al fV/kT <1. The one trouble with Eq. (4) is that it erro- 
neously predicts v? = oo when G,(f) is integrated over all f. However, you seldom 
have to deal directly with v? because v(t) is always subject to the filtering effects of 
other circuitry. That topic will be examined shortly. Meanwhile, we’ll use Eq. (4) to 
construct the Thévenin equivalent model of a resistance, as shown in Fig. 9.3—2a. 
Here the resistance is replaced by a noiseless resistance of the same value, and the 


Gf) 





Figure 9.3-1 Thermal noise spectra density, V?/Hz. 
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Gf) = 
"2K9 /R 





Figure 9.3-2 Thermal resistance noise. (a) Thévenin equivalent; (b] Norton equivalent. 


noise is represented by a mean square voltage generator. Similarly, Fig. 9.3-2b is the 
Norton equivalent with a mean square current generator having G,(f) = G,(£/R? = 
2k9 /R. Both generators are shaded to indicate their special nature. 

Instead of dealing with mean square voltage or current, describing thermal 
noise by its available power cleans up the notation and speeds calculations. Recall 
that available power is the maximum power that can be delivered to a load from a 
source having fixed nonzero source resistance. The familiar theorem for maximum 
power transfer states that this power is delivered only when the load impedance is 
the complex conjugate of the source impedance. The load is then said to be matched 
to the source, a condition usually desired in communication systems. 

Let the sinusoidal source in Fig. 9.3-3a have impedance Z, = R, + jX, and let 
the open-circuit voltage be v,. If the load impedance is matched, so that Z; = Zt = 
R, — jX, then the terminal voltage is v,/2 and the available power is 


_ <u Q/2P> «uo 
cee ae | 


7 S 
i 


a 


Using our Thévenin model, we extend this concept to a thermal resistor viewed as a 
noise source in Fig. 9.3-3b. By comparison, the available spectral density at the 
load resistance is 


G, 
W lig W/Hz [5] 


A thermal resistor therefore delivers a maximum power density of kJ/2 W/Hz to a 
matched load, regardless of the value of R! 


a " 
2-zé G&(f)= 
Zy =Z v 
«6 em UDE r 
la) (b) 
Figure 9.3-3 Available power. {a} AC source with matched lead; (b) thermal resistance with 


matched load. 
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Calculate from Eq. (1) the rms noise voltage across the terminals of a 1 — kQ resis- EXERCISE 9.3-1 


age that comes from frequency components in the range |f| < 1 GHz. 





White Noise and Filtered Noise 


Besides thermal resistors, many other types of noise sources are gaussian and have a 
flat spectral density over a wide frequency range. Such a spectrum has all frequency 
: components in equal proportion, and is therefore called white noise by analogy to 
i white light. White noise 1s a convenient model (and often an accurate one) in com- 
3 munications, and the assumption of a gaussian process allows us to invoke all the 
aforementioned properties—but some applications (beyond our scope) may need a 
more advanced model for the noise. 
We'll write the spectral density of white noise in general as 


G(f) = No/2 [6a] 


where Np represents a density constant in the standard notation. The seemingly 
extraneous factor of 1/2 is included to indicate that half the power is associated with 
positive frequency and half with negative frequency, as shown in Fig. 9.3-4a. Alter- 
natively, No stands for the one-sided spectral density. The autocorrelation function 
for white noise follows immediately by Fourier transformation of G( f), so 


. tance at J = 29 K. Then use Eq. (2D) to find the percentage of the mean square volt- 
Í 





R(r) = oalr) [6b] 


as in Fig. 9.3-4b. 

We thus see that R(T) = 0 for 7 # 0, so any two different samples of a gaussian 
white noise signal are uncorrelated and hence statistically independent. 'This obser- 
vation, coupled with the constant power spectrum, leads to an interesting conclu- 
sion, to wit: When white noise is displayed on an oscilloscope, successive sweeps 
are always different from each other; but the waveform always looks the same, no 


G(f) R(t) 
No 
E 
No 
2 i 
0 f 0 
(a) (b) 


Figure 9.3-4 White noise. (a) Power spectrum; (b) autocorrelation. 
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matter what sweep speed is used, since all rates of time variation (frequency compo- 
nents) are contained in equal proportion. Sumilarly, when white noise drives a loud- 
speaker, it always sounds the same, somewhat like a waterfall. 

The value of No in Eqs. (6a) and (6b) depends upon two factors: the type of 
noise, and tbe type of spectral density. If the source is a thermal resistor, then the 
mean square voltage and mean square current densities are 


Nop = 4RKT Ny = 4KT/R [7] 


where the added subscripts v and i identify the type of spectral density. Moreover, 
any thermal noise source by definition has the available one-sided noise spectral 
density 2G,(f) = kT. Other white noise sources are nonthermal in the sense that the 
available power is unrelated to a physical temperature. Nonetheless, we can speak of 
the noise temperature Jy of almost any white noise source, thermal or nonthermal, 
by defining 





2G(f) N, 
y A Ua 0 
gj rm = — 8 
N k x [8al 
Then, given a source’s noise temperature, 
No = ky [8b] 


It must be emphasized that Jy is not necessarily a physical temperature. For 
instance, some noise generators have Jy ~ 109, = 3000 K (5000? F), but the 
devices surely aren’t that hot. 

Now consider gaussian white noise x(f) with spectral density Gf) = Ny2 
applied to an LTT filter having transfer function H(f). The resulting output y(t) will 
be gaussian noise described by 


SU) - TOP 9 
Re) = 229 [Ae] o 
yal mora isa 


Pay careful attention to Eq. (9a) which shows that the spectral density of filtered white 
noise takes the shape of |H(f)P. We therefore say that filtering white noise produces 
colored noise with frequency content primarily in the range passed by the filter. 

As an illustration of filtered noise, let H(f) be an ideal lowpass function with 
unit gain and bandwidth B. Then 


CU re Sn (+) R,(7) = NoB sinc 2Br [10a] 
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Gf) Ry(T) 
| 
NoB 
No 
2 
" 
-B 0 B f 1 0 P 
2B 2B 
la] (b) 
Figure 9.3-5 White noise passed by an ideal LPF. (a) Power spectrum; [b] autocorrelation. 


which are plotted in Fig. 9.3-5. Besides the spectral shaping, we see that lowpass 


filtering causes the output noise to be correlated over time intervals of about 1/2B. 
We also see that 


y? = NB [105] 


so the output power is directly proportional to the filter's bandwidth. 





Thermal Noise in an RC Network EXAMPLE 9.3-1 


To pull together several of the topics so far, consider the RC network in Fig. 9.3-6a 
with the resistor at temperature J. Replacing this thermal resistor with its Thévenin 
model leads to Fig. 9.3-6b, a white noise mean square voltage source with G, (f) = 
2RkT V?/Hz applied to a noiseless RC LPF. Since |H(f)|* = [1 + (//B)*1"}, the out- 
put spectral density is 


2Rk9 1 
G, = |H(f)|?G So B=, 11 
The inverse transform then yields 
—2nB\r| kI -|r|/RC 
R(T) = 2RkI «Be TAn = e [115] 
C 





(a) (b) 


Figure 9.3-6 la] RC network with resistance noise; (b) noise equivalent model. 
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which shows that the interval over which the filtered noise voltage has appreciable 
correlation approximately equals the network’s time constant RC, as might have 
been suspected. 

We can further say that y(t) is a gaussian random signal with no dc component— 
since x(f) is a zero-mean gaussian—and 


-— kT 
y? = R,(0) = pz [12] 


Surprisingly, y? depends on C but not on R, even though the noise source is the ther- 
mal resistor! This paradox will be explained shortly; here we conclude our example 
with a numerical calculation. 

Suppose the resistor is at room temperature Jy and C = 0.1 uF; then 


y -4x10"/107"24x107* v? 


and the rms output voltage is cy = 2 X 1077 = 0.2 u V. Such exceedingly small val- 
ues are characteristic of thermal noise, which is why thermal noise goes unnoticed 
in ordinary situations. However, the received signal in a long-distance communica- 
tion system may be of the same order of magnitude or even smaller. 





a 
^ 


Noise Equivalent Bandwidth 


Filtered white noise usually has finite power. To emphasize this property, we desig- 
nate average noise power by N and write Eq. (9c) in the form 


No E 2 m T 2 
wet \H(f)| a= vo H(A)? df 


Noting that the integral depends only on the filter’s transfer function, we can sim- 
plify discussion of noise power by defining a noise equivalent bandwidth B, (or 
just the noise bandwidth) as 


al 
N' 


B if |\H( f)|* af [13] 
8 Js 
where 


g = |H) ix 


which stands for the center-frequency power ratio (assuming that the filter has a 
meaningful center frequency). Hence the filtered noise power is 


N = gNoBy [14] 
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This expression becomes more meaningful if you remember that N, represents 
density. 

Examining Eq. (14) shows that the effect of the filter has been separated into 
two parts: the relative frequency selectivity as described by By, and the power gain 
(or attenuation) represented by g. Thus, By equals the bandwidth of an ideal rectan- 
gular filter with the same gain that would pass as much white noise power as the fil- 
ter in question, as illustrated in Fig. 9.3-7 for a bandpass filter. 

Let's apply Eqs. (13) and (14) to the RC LPF in Example 9.3-1. The filter’s 
center frequency is f = 0 so g = |H(0)? = 1, and 


By= | af as ae [15] 
| 1-(ffBR 2" 4RC 





The reason why y? in Eq. (12) is independent of R now becomes apparent if we 
write y? = N = N,By = (ARk9) X (1/ARC). Thus, increasing R increases the 
noise density Nọ (as it should) but decreases the noise bandwidth By. These two 
effects precisely cancel each other and y? = K9'/C. 

By definition, the noise bandwidth of an ideal filter is its actual bandwidth. For 
practical filters, By is somewhat greater than the 3-dB bandwidth. However, as the 
filter becomes more selective (sharper cutoff characteristics), its noise bandwidth 
approaches the 3-dB bandwidth, and for most applications we are not too far off in 
taking them to be equal. 

In summary, if y(t) is filtered white noise of zero mean, then 


y? = g} = N = SN BS Oy = VN - V gNSBy 116] 


This means that given a source of white noise, an average power meter (or mean 
square voltage meter) will read y? = N = N)By, where By is the noise bandwidth of 
the meter itself. Working backward, the source power density can be inferred via 
No = N/By, provided you’re sure that the noise is white over the frequency-response 
range of the meter. 


Figure 9.3-7 Noise equivalent bandwidth of a bandpass filter. 
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Consider an nth-order Butterworth LPF defined by Eq. (4), p. 113. Show that the 
noise bandwidth By is related to the 3-dB bandwidth B by 
7B 


By = ——— —— 
N 2nsin(r/2n) 





EXERCISE 9.3-2 


[17] 


Hence, By > B as n > %. 





System Measurements Using White Noise *« 


Since white noise contains all frequencies in equal proportion, it's a convenient signal 
for system measurements and experimental design work. Consequently, white noise 
sources with calibrated power density have become standard laboratory instruments. 
A few of the measurements that can be made with these sources are outlined here. 


Noise Equivalent Bandwidth Suppose the gain of an amplifier is known, and we 
wish to find its noise equivalent bandwidth. To do so, we can apply white noise to 
the input and measure the average output power with a meter whose frequency 
response is essentially constant over the amplifier's passband. The noise bandwidth 
in question is then, from Eq. (14), By = N/gNp. 


Amplitude Response — To find the amplitude response (or amplitude ratio) of a given 
system, we apply white noise to the input so the output power spectrum is propor- 
tional to |H(f)P. Then we scan the output with a tunable bandpass filter whose band- 
width is constant and small compared to the variations of |H(f)|?. Figure 9.3-8a dia- 
grams the experimental setup. If the scanning filter is centered at f., the rms noise 
voltage at its output is proportional to |H(f.)|. By varying f., a point-by-point plot of 
|H(f)| is obtained. 


Impulse Response Figure 9.3-8b shows a method for measuring the impulse 
response A(t) of a given system. The instrumentation required is a white noise 
source, a variable time delay, a multiplier, and an averaging device. Denoting the 
input noise as x(t), the system output is A'x(r), and the delayed signal is x(t — ty). 
Thus, the output of the multiplier is 





oo 
z(t) = x(t — tj)|h*x(t)] = x(t — 21 h(A)x(t — A)dA 
—oo 
Now suppose that z(t) is averaged over a long enough time to obtain <z(t)>. If the 
noise source 1s ergodic and the system is linear and time-invariant, then the average 
output approximates the ensemble average 


oo 


E(z(t)] = f A(A)E[x(t — tj)x(t — A)]da = | A(A)R,(A — tj)dÀ 


—co —oo 


But with x(t) being white noise, Eq. (6b) says that RAA — tj) = (Ny2)6(A — ta). 
Hence, 
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Figure 9.3-8 System measurements using white noise. (a) Amplitude response; (b) impulse 
response. 


«z(t)- an h(A)8(A — ty)dA = - h(t4) 


and A(t) can be measured by varying the time delay fj. 

The measurement techniques in Fig. 9.3—8 have special value for industrial pro- 
cessing or control systems. A conventional sinusoidal or pulsed input could drive 
such a system out of its linear operating region and, possibly, cause damage to the 


system. Low-level white noise then provides an attractive alternative for the test 
input signal. 





9.4 BASEBAND SIGNAL TRANSMISSION WITH NOISE 


At last we're ready to investigate the effects of noise on electrical communication. 
We begin here by studying baseband signal transmission systems with additive noise, 
and we'll introduce the signal-to-noise ratio as a measure of system performance in 
regard to analog communication. Section 9.5 focuses on pulse transmission. 

Throughout this section and Section 9.5, we'll restrict our consideration to a 
linear system that does not include carrier modulation. This elementary type of sig- 
nal transmission will be called baseband communication. The results obtained for 
baseband systems serve as a benchmark for comparison when we discuss carrier 
modulation systems with noise in subsequent chapters. 
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Additive Noise and Signal-to-Noise Ratios 


Contaminating noise in signal transmission usually has an additive effect in the 
sense that 





For purposes of analysis, all the noise will be lumped into one source added to the 


signal x,(t) at the input of the receiver. Figure 9.4-1 diagrams our model of additive 
noise. 


This model emphasizes the fact that the most vulnerable point in a transmission 
system is the receiver input where the signal level has its weakest value. Further- 
more, noise sources at other points can be "referred" to the receiver input using tech- 
niques covered in the appendix. 


Since the receiver is linear, its combined input produces output signal plus 
noise at the destination. Accordingly, we write the output waveform as 


Xp(t) = xp(t) + no(t) [t] 


where xp(t) and np(t) stand for the signal and noise waveforms at the destination. 
The total output power is then found by averaging y3(t) = xg(t) + 2xp(f)np(t) + 
ng(t). 
To calculate this average, we'll treat the signal as a sample function of an 
ergodic process and we'll make two reasonable assumptions about additive noise: 


1. The noise comes from an ergodic source with zero mean and power spectral 
density G,(f); 


2. The noise is physically independent of the signal and therefore uncorrelated 
with it. 


Under these conditions the statistical average of the crossproduct xp(t)np(f) equals 
zero because xp(t) and np(t) are incoherent. Thus, the statistical average of yg(t) yields 


yp = xp * nb [2] 
which states that we have superposition of signal and noise power at the destination. 


Additive noise 


Received GAC f) Destination 


signal 
R , . yp) = 
xp(t) ( ) | Linear xp(t) + np) 


receiver 


Figure 9.4-1 Model of received signal with additive noise. 








9.4 Baseband Signal Transmission with Noise 


Let's underscore the distinction between desired signal power and unwanted 
noise power by introducing the notation 


A 2 
Sp ? xp Np 


lip 
Sel 
oY 


[3a] 
so that 


y» = Sp + Np [3b] 


The signal-to-noise ratio will now be defined as the ratio of signal power to noise 
power, symbolized by 


(S/N )p = Sp/Np = x5/ n5 [4] 
and often expressed in decibels. 

This ratio provides an important and handy indication of the degree to which 
the signal has been contaminated with additive noise. But note that the interpretation 
of signal-to-noise ratio is meaningful only when Eq. (2) holds. Otherwise, y2 would 
include additional terms involving the crossproduct of signal times noise. 

Superposition of signal and noise power is a helpful condition in experimen- 
tal work because you can't turn off the noise to determine Sp alone. Instead, you 
must measure Np alone (with the signal off) and measure y2, = Sp + Np (with the 
signal on). Given these measured values, you can calculate (S/N)p from the relation- 
ship y5/Np = (Sp + Np)/Np = (S/N)p + 1. 

For analytic work, we generally take the case of white noise with G,(f) = Ny2. 


If the receiver has gain gp and noise bandwidth By, the destination noise power 
becomes 


Np = ga NoBy [5] 


When the noise has a gaussian distribution, this case is called additive white gauss- 
ian noise (AWGN), which is often the assumed model. 

In any white-noise case, the noise density may also be expressed in terms of the 
noise temperature F y referred to the receiver input, so that 


No = kTy = k9o(Su/99) ~ 4 X 10 *(9y/9y) W/Hz [6] 


where we've introduced the standard temperature J, for numerical convenience. 
Typical values of J, range from about 0.27, (60 K) for a carefully designed low- 
noise system up to 109 , or more for a “noisy” system. 


Analog Signal Transmission 


Figure 9.4—2 represents a simple analog signal baseband transmission system. The 
information generates an analog message waveform x(t), which 1s to be reproduced at 
the destination. We'll model the source as an ergodic process characterized by a mes- 
sage bandwidth W such that any sample function x(f) has negligible spectral content 
for |f| > W. The channel is assumed to be distortionless over the message bandwidth 
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Source Transmitter — Distortionless White noise Receiver Destination 
channel Gal F) = No/2 
N 
x(t) Xp(t) xp() -- np(B 
SUAE L O ppp e 20ta 
S Sp Sp | Sp+Np 
ee el 
Figure 9.4-2 Analog baseband transmission system with noise. 


so that xp() = Kx(t — tj), where K and t, account for the total amplification and time 
delay of the system. We'll concentrate on the contaminating effects of additive white 
noise, as measured by the system's signal-to-noise ratio at the destination. "m 

The average signal power generated at the source can be represented as S, B xL 
Since the channel does not require equalization, the transmitter and receiver merely 
act as amplifiers over the message band with power gains gr and gg compensating 
for the transmission loss L. Thus, the transmitted signal power, the received signal 
power, and the destination signal power are related by 


See Hes S etes E sev dx pus. 7] 


These three parameters are labeled at the corresponding locations in Fig. 9.4—2. 
The figure also shows a lowpass filter as part of the receiver. This filter has the 
crucial task of passing the message while reducing the noise at the destination. Obvi- 
ously, the filter should reject all noise frequency components that fall outside the mes- 
sage band—which calls for an ideal LPF with bandwidth B = W. The resulting desti- 
nation noise power will be Np = ggNoW, obtained from Eq. (5) with By = B = W. 


We now see that the receiver gain gg amplifies signal and noise equally. There- 
fore, gg cancels out when we divide Sp by Np, and 


(S/N)p = Sg/ NQW [8] 


This simple result gives the destination signal-to-noise ratio in terms of three funda- 
mental system parameters: the signal power Sz and noise density Nọ at the receiver 
input, and the message bandwidth W. We can also interpret the denominator N9W as 
the noise power in the message band before amplification by gg. Consequently, a 
wideband signal suffers more from noise contamination than a narrowband signal. 

For decibel calculations of (S/N)p, we'll express the signal power in milliwatts 
(or dBm) and write the noise power in terms of the noise temperature F y. Thus, 


S n" SR Fy 
N = 10 logio IW Sa. t 174 — 10 logio a," 19] 


where the constant 174 dB comes from Eq. (6) converted to milliwatts. 
Table 9.4—1 lists typical dB values of (S/N), along with the frequency range 
needed for various types of analog communication systems. The upper limit of the 
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frequency range equals the nominal message bandwidth W. The lower limit also has 
design significance because many transmission systems include transformers or 
coupling capacitors that degrade the low-frequency response. 





Table 9.4-1 Typical transmission requirements for selected analog signals 
Signal Type Frequency Range Signal-to-Noise Ratio, dB 
Barely intelligible voice 500 Hz to 2 kHz 5-10 

Telephone-quality voice 200 Hz to 3.2 KHz 25-35 

AM broadcast quality audio 100 Hz to 5 kHz 40-50 

High-fidelity audio 20 Hz to 20 kHz 55-65 

Video 60 Hz to 4.2 MHz 45-55 


The destination signal-to-noise ratio doesn’t depend on the receiver gain, which 
only serves to produce the desired signal level at the output. However, (S/N) p will be 
affected by any gains or losses that enter the picture before the noise has been added. 
Specifically, substituting Sg = SL in Eq. (8) yields 


(S/N)p = Sz/LN,W [10] 


so (S/N)p is directly proportional to the transmitted power Sy and inversely propor- 
tional to the transmission loss L—a rather obvious but nonetheless significant con- 
clusion. ; 

When all the parameters in Eq. (10) are fixed and (S/N)p turns out to be too 
small, consideration should be given to the use of repeaters to improve system per- 
formance. In particular, suppose that the transmission path is divided into equal sec- 
tions, each having loss Z4. If a repeater amplifier with noise density No is inserted at 
the end of the first section, its output signal-to-noise ratio will be 


(S/N), = Sy/ LuNQW 


which follows immediately from Eq. (10). Repeaters are often designed so the 
amplifier gain just equals the section loss. The analysis in the appendix then shows 
that if the system consists of m identical repeater sections (including the receiver), 
the overall signal-to-noise ratio becomes 


Eres) tar Con] mi 
N/]p M\N/, mLINLNW 
Compared to direct transmission, this result represents potential improvement by a 
factor of L/mL;. 

It should be stressed that all of our results have been based on distortionless 
transmission, additive white noise, and ideal filtering. Consequently, Eqs. (8)-(11) 


represent upper bounds on (S/N)p for analog communication. If the noise band- 
width of the lowpass filter in an actual system is appreciably greater than the 
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message bandwidth, the signal-to-noise ratio will be reduced by the factor W/By. 
System nonlinearities that cause the output to include signal-times-noise terms also 
reduce (S/N). However, nonlinear companding may yield a net improvement. 





EXAMPLE 9.4-1 Consider a cable system having L = 140 dB = 10! and Jy = 59 ,. If you want 
high-fidelity audio transmission with W = 20 kHz and (S/N)p = 60 dB, the neces- 





aiti dst cuu lag 







sary signal power at the receiver can be found from Eq. (9) written as 
Sp,,, + 174 — 10 logyo(5 X 20 x 10°) = 60 dB 

Hence, Sp = —64 dBm = 4 X 107" mW and the transmitted power must be Sy = 
LSp = 4 X 10? mW = 40,000 W. 3 

Needless to say, you wouldn't even try to put 40 KW on a signal transmission E 
cable! Instead, you might insert a repeater at the midpoint so that L, = 70 dB = 10". 3 
(Recall that cable loss in dB is directly proportional to length.) The resulting 4 
improvement factor in Eq. (11) is a 


L 1014 


—— = ———— = x 10$ 


which reduces the transmitted-power requirement to $7 = (4 X 107 mW)/(5 x 109) = 
8 mW—a much more realistic value. You would probably take $7 in the range of 
10—20 mW to provide a margin of safety. 








EXERCISE 9.4-1 Repeat the calculations in Example 9.4~1 for the case of video transmission with W = 
4.2 MHz and (S/N)p = 50 dB. 








9.5 BASEBAND PULSE TRANSMISSION WITH NOISE 


This section looks at baseband pulse transmission with noise, which differs from 
analog signal transmission in two major respects. First, rather than reproducing a 
waveform, we're usually concerned with measuring the pulse amplitude or arrival 
time or determining the presence or absence of a pulse. Second, we may know the 
pulse shape in advance, but not its amplitude or arrival time. Thus, the concept of 
signal-to-noise ratio introduced in Sect. 9.4 has little meaning here. 


Pulse Measurements in Noise 


Consider initially the problem of measuring some parameter of a single received 
pulse p(f) contaminated by additive noise, as represented by the receiver dia- 
grammed in Fig. 9.5-1a. Let the pulse be more-or-less rectangular with received 
amplitude A, duration 7, and energy E, — A?r. Let the noise be white with power 
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() LPF 
ZO gi ie y(t) = p(t) + nls) 





| 2T 
G(f) = Ny2 
la] 
v(r) 
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A 
t 
eens dp qe 
(b) 
Figure 9.5-1 Pulse measurement in noise. (a] Model; (b) waveform. 


spectral density G(f) = N,/2 and zero mean value. The pulse will be passed and the 
excess noise will be removed by a reasonably selective lowpass filter having unit 
gain and bandwidth B ~ By = 1/27. Thus, the output y(t) = p(t) + n(t) sketched in 
Fig. 9.5-1b consists of noise variations superimposed on a trapezoidal pulse shape 
with risetime f, ~ 1/2By. 

If you want to measure the pulse amplitude, you should do so at some instant f, 


near the center of the output pulse. A single measurement then yields the random 
quantity 


y(ta) SAT n(ta) SAt €A 
where e, = n(t,) represents the amplitude error. Thus, the error variance is 


which should be small compared to A? for an accurate measurement. Since A? — 
E,/7 and By = 1/27, we can write the lower bound 
No NA 


02 z— [2] 


2T 2E, 


Any filter bandwidth less than about 1/27 would reduce the output pulse amplitude 
as well as the noise. Achieving the lower bound requires a matched filter as dis- 
cussed later. 

Measurements of pulse arrival time or duration are usually carried out by detect- 
ing the instant £, when y(t) crosses some fixed level such as A/2. The noise perturba- 
tion n(t,) shown in the enlarged view of Fig. 9.5-2 causes a time-position error €, 
From the similar triangles here we see that e,/n(t;) = t,/A, so e, = (t,/A)n(t,) and 


o? = (t/Ay r? s (t,/A)° NoBy 
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Figure 9.5-2 Time-position error caused by noise. 


Substituting t, = 1/2By and A? = E,/7 yields 


2 No Not 


~ - [3] 
Ot AByA? 4By E, 








which implies that we can make g, arbitrarily small by letting By — © so that t, > 0. 


But the received pulse actually has a nonzero risetime determined by the transmis- 
sion bandwidth Br. Hence, 





D No = Not 
o; = z= [4] 
4B;A?  AB,E, 








and the lower bound is obtained with filter bandwidth By ^ B;—in contrast to the 
lower bound on c, obtained with By = 1/27. 





EXERCISE 9.5-1 Suppose a 10-ys pulse is transmitted on a channel having By = 800 kHz and Ny = 
E,/50. Calculate o4/A and o /r when: (a) By = 1/27; (b) By = Br. 
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Pulse Detection and Matched Filters 


When we know the pulse shape, we can design optimum receiving filters for detect- 
ing pulses buried in additive noise of almost any known spectral density G,(f). Such 
optimum filters, called matched filters, have extensive applications in digital com- 
munication, radar systems, and the like. 

Figure 9.5-3a will be our model for pulse detection. The received pulse has 
known shape p(t) but unknown amplitude A, and arrival time fo, so the received sig- 


nal is 
x«t) = App(t — to) (Sal 
Thus, the Fourier transform of xp() will be 
Xf) = ApP(fye "^ [5b] 


vapor 
: » 
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Figure 9.5-3 Pulse detection in noise. (a) Model; (b] filtered output. 


where P(f) = &[p(d)]. The total received pulse energy is 


B= | pupa ai OR a 
65 s 
To detect the pulse in the face of additive noise, the receiving filter should compress 
the pulse energy into an output pulse with peak amplitude A at some specific instant, 
say f = fy + ta and the filter should also minimize the rms output noise. The output 
waveform would thus look something like Fig. 9.5-3b. We seek the filter transfer 
function H(f) that achieves this goal, given p(t) and G,(f). 
First, we write the peak pulse amplitude in terms of H(f) and P(f), namely, 


eo 


A= FTAA E - al H(f)P(f)e* df 6] 


Next, assuming that the noise has zero mean, its output variance is 


whe | (A? GC) df 


We want to maximize the ratio of peak output amplitude A to rms noise © or, 
equivalently, 





|| moe df 


AN - 
(4) = AZ i [7] 


|H(F)|? GCF) df 





where H(f) is the only function at our disposal. Normally, optimization requires the 
methods of variational calculus. But this particular problem (and a few others) can 
be solved by adroit application of Schwarz’s inequality. 


389 


390 CHAPTER? € Random Signals and Noise 


For our present purposes we draw upon the inequality from p. 128 in the form 





| : vewa . 
P.we < |. WF) |? df 


where V(f) and W(f) are arbitrary functions of f. The left-hand side of this inequal- 
ity is identical to Eq. (7) with 


Wf) = HA) VG) 
AH(f)P(f)e" — A,P(f els 
VS) V G(f) 


and the inequality becomes an equality when V(f) is proportional to W(f). There- 
fore, if V(f) = KW(fy/Ag, then we obtain the maximum value 


wef) = 





AX ; Í » |P(f)|? 
M = 8 
ONE Ly * = 
The corresponding optimum filter must have 
P*( fje 
H, = Kee [9] 
aU D 


where K is an arbitrary gain constant. Observe that |H,,.(f)| is proportional to |P(f)| 
and inversely proportional to G,(f). Hence, the optimum filter emphasizes those fre- 
quencies where the pulse spectrum is large and deemphasizes those frequencies 
where the noise spectrum is large—-a very sensible thing to do. 

In the special but important case of white noise with G,(f) = Ny2, Eq. (8) 


reduces to 
AV 2A; [ l 2E, 
E = P 2 df = — [10] 
(5). N, a (f)|? af N, 


which brings out the significance of the energy E, for pulse detection. The impulse 
response of the optimum filter is then 


2K ; 2K 
holt) = g| PH f jee = — p(t — t) [11] 
: No NU 
The name matched filter comes from the fact that /,,,(t) has the same shape as the 
pulse p(t) reversed in time and shifted by t,. The value of t; equals the delay between 
the pulse arrival and the output peak, and it may be chosen by the designer. 
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Sometimes the matched filter turns out to be physically unrealizable because 
hyy(f) has precursors for t < 0. However, a reasonable approximation may be 
obtained by taking ¢, large enough so that the precursors are negligible. 





Consider the rectangular pulse shape p(t) = u(t) — u(t — 7). Find h,4(t) from Eq. 
(11) when K = Ny2, and determine the condition on t; for realizability of the 


matched filter. Then use the convolution A, (2) x«(t) to obtain the peak value A of the 
output pulse. 


EXERCISE 9.5~2 





9.6 PROBLEMS 


9.1-1* The random variable X has a uniform distribution over 0 = x S 2. Find v(t), 
R,(t,,t2), and v(t) for the random process u(t) = 6e*'. 


9.1-2 Do Prob. 9.1--1 with u(t) = 6 cos Xt. 


9.1-3 Let X and Y be independent RVs. Given that X has a uniform distribution over —1 = 
x = 1 and that Y = 2 and Y? = 6, find v(t), R,(4,t,), and v*(r) for the random 
process v(t) = (Y + 3Xpt. 

9.1-4 Do Prob. 9.1-3 with v(r) = Ye*. 

9.1-5 Do Prob. 9.1-3 with v(t) = Y cos Xt. 


9.1-6t Let v(t) = A cos (2ar Ft + ®) where A is a constant and F and ® are RVs. If b has a 
uniform distribution over 27 radians and F has an arbitrary PDF p;(f), show that 


AT 
| R,(t), t3) = = | cos 2v A(t, — t5)pg(A) dÀ 


Also find u(t) and v(t), 


| 9.1-7* Let X and Y be independent RVs, both having zero mean and variance o?. Find the 
crosscorrelation function of the random processes 


| v(t) = X cos wot + Y sin wot 


w(t) = Y cos wot — X sin wot 


| 9.1-8 Consider the process v(t) defined in Prob. 9.1-7. (a) Find v(t) and R,(t,,t2) to con- 


firm that this process is wide-sense stationary. (b) Show, from E[v?(7)] and «v2(2)7, 
that the process is not ergodic. 


| i 9.1-9 Let v(t) = A cos (wot + P), where A and ® are independent RVs and € has a uni- 


form distribution over 27 radians. (a) Find v(t) and R,(t;,t,) to confirm that this 


process is wide-sense stationary. (b) Show, from E[v?()] and <v (A>, that the 
| process is not ergodic. 
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9.1-11 
9.2-1* 


9.2-2 
9.2-3 
9.2-4 
9.2-5t 


9.2-6 


9.2-7 


9.2-8 
9.2-9 
9.2-10 
9.2-11 
9.2-12* 
92-13 


9.2-14 


9.2-15t 


CHAPTER? © Random Signals and Noise 


Let z(t) = u(t) — v(t + T), where v(t) is a stationary nonperiodic process and T is a 
constant. Find the mean and variance of z(t) in terms of R,(r). 


Do Prob. 9.1-10 with z(t) = v(t) + v(t — T). 


A certain random signal has R,(7) = 16e 0? + 9, Find the power spectrum and 
determine the signal's dc value, average power, and rms value. 


Do Prob. 9.2-1 with R,(7) = 32 sinc? 8T + 4 cos 87. 


Consider the signal defined in Prob. 9.1-6. Show that G,(f) = (A7/4)[p-(f) + 


pr(—f)]. Then simplify this expression for the case when F = fo, where fo is a 
constant. 


Consider the spectral estimate Gf) = |Vr(f, s) /T. Use Eq. (8), p. 365, to show 
that E[G,(f)] = (T sinc? f T) * G,( f) What happens in the limit as T — co? 


Let v(t) be a randomly phased sinusoid. Show that Eq. (7), p. 364, yields G,(f) in 
Eq. (4), p. 363. 


Modify Eqs. (115)-(114d), p. 367, for the case when v(t) and w(t) are independent 


stationary signals with mymy # 0. Show from your results that R, (+) = (my + my}? 
and that z? > 0. 


Let v(t) and w(t) be jointly stationary, so that R,y(t,,t.) = Ry(t, — h). Show that 
Ry(r) = RT) 
What’s the corresponding relationship between the cross-spectral densities? 


Let z(t) = v(t) — v(t + T), where v(t) is a stationary random signal and T is a con- 
stant. Start with R.(t,,45) to find R,(7) and G,(f) in terms of R,(7) and G,(f). 


Do Prob. 9.2-8 with z(t) = v(t) + v(t — T). 


Let z() = A cos Qarfit + bj) cos Qarfat + Bz), where A, f, and f, are constants, and 
G, and ®, are independent RVs, both uniformly distributed over 27 radians. Find 
Gf) and simplify it for the case fi = f;. 


Confirm that R(T) = A(—7) * Ry,(7). 


Let y(t) = dx(t)/dt. Find R,(r) and R,,(7) in terms of R(T) by taking the inverse 
transforms of G,(f) and G,,(/f). 


Let y(t) = x(t) — ax(t — T), where œ and T are constants. Obtain expressions for 
G,(f) and R,(7). 


Use the relation in Prob. 9.2-7 to show that 
R(T) =a -Ê (7) 
The moving average of a random signal x(ż) is defined as 


1 t+T/2 
y(t) = = | x(A) dar 
T 1-7/2 
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Find H(f) such that y(t) = A(t)'x(t) and show that 


R,(r) = - i (1 - P jue ~- A)dd 


Derive Eq. (2b), p. 373, by using series approximations in Eq. (2a), p. 373. 


Use Eq. (17D), p. 369, to show that Eq. (9b), p. 376, can be written in the alternate 
form 


No 


Rin) = le h(t) h(t + 7)dt 


Find G,(f), R(T), and y? when white noise is filtered by the zero-order hold on 
p. 241. 
Do Prob. 9.3-3 with a gaussian filter having H(f) = Ke" (4. 


Do Prob. 9.3-3 with an ideal BPF having gain K and delay t, over the freguegey 
range fy — B/2 x |f| = fy + B2. 


Do Prob. 9.3—3 with an ideal HPF having gain K and delay fy over the frequency 
range |f] = fo. 


Figure P9.3—7 represents a white-noise voltage source connected to a noiseless RL 


. network. Find G,(f), Ry(7), and y? taking y(2) as the voltage across R. 


R 


GG) = No2 e AL 
Figure P9.3-7 


Do Prob. 9.3-7 taking y(t) as the voltage across L. 


The spectral density of the current i(7) in Fig. P9.3—7 is G(f) = No,/(2|R + joL). If 
the source represents thermal noise from the resistance, then the equipartition the- 
orem of statistical mechanics requires that Li? = kJ. As a consequence, show 
that No, = 4RKZ , in agreement with Eq. (7), p. 376. 


Thermal noise from a 10 KQ resistance at room temperature is applied to an ideal 
LPF with B — 2.5 MHz and unit gain. The filtered noise is applied to a full-wave 
rectifier, producing z(t) = |y(2)|. Calculate the mean and rms value of z(f). 


Do Prob. 9.3~10 with a half-wave rectifier, so that z( = O for y(t) < 0. 
Thermal noise from a 10 KÊ resistance at room temperature is applied to an ideal 
LPF with B = 2.5 MHz and unit gain. The filtered noise voltage is then applied to a 


delay line producing z(t) = y(t — T). Use Eqs. (14) and (15), p. 344, to find the joint 
PDF of the random variables Y = y(t) and Z = z(t) when T = 1 ps. 
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Do Prob. 9.3-12 with T = 0.1 us. 
Find By for the gaussian LPF in Prob. 9.3-4. Compare By with the 3-dB bandwidth B. 


Impulse noise, which occurs in some communication systems, can be modeled as 
the random signal 


x)= Y AB - T) 


k--oco 


where the A, and T, are independent sets of random variables. The impulse weights 
A, are independent and have zero mean and variance o?. The delay times T, are gov- 
emed by a Poisson process such that the expected number of impulses in time T 
equals uT. Use Eq. (7), p. 364, to show that impulse noise has a constant spectral 
density given by G,(f) = wo. 

Calculate (S/N), in dB for a baseband system with Fy = To, W = 4 MHz, and Sp = 
0.02 uW. 


Calculate (S/N)p in dB for a baseband system with Ty = 599, W = 2 MHz, and 
Sg = 0.004 uW. 


A baseband analog transmission system with W = 5 kHz has (S/N), = 46 dB when 
Sr = 100 mW. If the receiver bandwidth is changed accordingly, what value of S; is 
appropriate to: (a) upgrade the system for high-fidelity audio; (b) downgrade the 
system for telephone-quality voice? 


Consider an AWGN baseband transmission system with W = 10 kHz. Express 
(S/N)p in a form like Eq. (8), p. 384, when the receiving filter is: (a) a first-order 
LPF with B = 15 kHz; (b) a second-order Butterworth LPF with B = 12 kHz. 


Consider a baseband transmission system with additive white noise and a distorting 
channel having |H-(f)|? = 1/ {L[1 + (f/Wy]). The distortion is equalized by a 


receiving filter with |H,(f)|? = [K/\H¢(f)|PM(f/2W). Obtain an expression for 
(S/N) p in the form like Eq. (10), p. 385. 


Do Prob. 9.4-5 with |Ho(f)|? = {LU + QffWy). 


A baseband signal with W = 5 kHz is transmitted 40 km via a cable whose loss is a = 
3 dB/km. The receiver has Ty = 1075. (a) Find Sy needed to get (S/N)p = 60 dB. 
(b) Repeat the calculation assuming a repeater at the midpoint. 


A cable transmission system with L = 240 dB has m = 6 equal-length repeater sec- 
tions and (S/N)p = 30 dB. Find the new value of (S/N)p if: (a) m is increased to 12; 
(b) m is decreased to 4. 


The cable for a 400-km repeater system has a = 0.5 dB/km. Find the minimum 


number of equal-length repeater sections needed to get (S/N) p = 30 dB if S/NoW = 
80 dB. 


If all other parameters of a cable transmission system are fixed, show that the num- 
ber of equal-length repeater sections that maximizes (S/N)p is m = 0.23L. 
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The open-circuit signal voltage output of an oscillator is A cos 27/5. The oscillator 
has a noiseless source resistance R and internally generated thermal noise at temper- 
ature Jy. A capacitance C is connected across the output terminals to improve the 
signal-to-noise ratio. Obtain an expression for S/N in terms of C, and find the value 
of C that maximizes S/N. 


A baseband pulse transmission system has By = 1/7, Ty = To, and E, = 107” J. 
Find (o4/A)?. 


Show that c; /t, = o4/A = V NoByr/ E, when By = Br. 


A baseband pulse transmission system has 7 = 5 us, By = 1 MHz, and Ny = 10? 
W/Hz. Find the minimum value of E, so that @4 = A/100, and calculate the corre- 
sponding value of o/7. 


A baseband pulse transmission system has 7 = 50 us, Br = 1 MHz, and Ny = 107” 
W/Hz. Find the minimum value of E, so that o, = 7/100, and calculate the corre- 
sponding value of a ,/A. 


A baseband pulse transmission system has 7 = 1 ms, By = 1 MHz, and No = 107". 
Find the minimum value of £, and the corresponding value of By so that o4 = A/100 
and c, = 7/1000. 


A rectangular pulse with duration 7 has been contaminated by white noise. The 
receiving filter is a first-order LPF with 3-dB bandwidth B > 1/7, rather than a 
matched filter. Obtain an expression for (A/o)? in terms of EJN,. 


Do Prob. 9.5—6 for arbitrary B. 


Let the shape of a received pulse be p(t) = A(t/7). Find the characteristics of the 
matched filter assuming white noise. Then find the condition on /, so that this filter 
can be realized. 


Let the shape of a received pulse be p(t) = e u(t). Find the characteristics of the 
matched filter assuming white noise. Then find the condition on t, so that a good 
approximation of this filter can be realized. 
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his chapter rounds out our study of analog modulation with an examination of system performance in the presence 
of contaminating noise. Our emphasis will be on CW modulation systems, but we'll also look at analog pulse 
modulation. 

We'll begin with the properties of the bandpass noise that appears in CW modulation systems. The assumption 
of bandpassfiltered white noise from a stationary gaussian process leads to mathematical descriptions that we 
employ to investigate additive noise effects in linear and exponential modulation systems. Our work culminates in a 
relative comparison of the several types of CW modulation. Then we'll examine the effects of additive noise on 
phasedock loops and analog pulse modulation. 


Further comparisons including both CW and pulse modulation will be made in Chap. 16, based upon absolute 
standards derived from information theory. 


OBJECTIVES 


After studying this chapter and working the exercises, you should be able to do each of the following: 


1. Sketch the noise power spectrum at the input of a CW demodulator (Sect. 10.1). 


2. Write the quadrature and envelope-and-phase expressions for bandpass noise, and relate the components to the 
power spectrum (Sect. 10.1). 


3. Write expressions for the predetection and postdetection signal-plus-noise when a CW modulation system has 
additive noise (Sects. 10.2 and 10.3). 


4. Sketch the noise power spectrum and calculate the noise power at the output of a CW demodulator with input 
bandpass noise (Sects. 10.2 and 10.3). 


5. Explain the meaning and significance of threshold effect, deemphasis improvement, and wideband noise reduc- 
tion (Sect. 10.3). 


6. Calculate (S/N)p and the threshold level, if any, for a CW modulation system with specified parameters (Sects. 
10.2 and 10.3). 


7. Select a suitable analog CW modulation type, given the desired system performance and constraints (Sect. 10.4). 
Describe the effects of additive noise on phase-lock loop performance (Sect. 10.5). 

9. Determine suitable parameters for a PAM, PDM, or PPM system given the desired (S/N)p (Sect. 10.6). 
10. Explain the meaning and significance of false-pulse threshold effect in PDM or PPM (Sect. 10.6). 
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10.1 BANDPASS NOISE 


Although we must give individual attention to the effects of noise on specific types of 
analog modulation, all analog CW communication systems have the same general 
structure and suffer from bandpass noise. This section summarizes the system mod- 
els and describes bandpass noise, assuming that the noise comes from an additive 
white gaussian noise (AW GN) process. In particular, we'll state the statistical proper- 
ties of the quadrature components and the envelope and phase of bandpass noise. The 
stated properties are subsequently justified by studying the correlation functions. 








10.1 Bandpass Noise 


System Models 


Figure 10.1—1 depicts our generalized model of an analog CW communication sys- 
tem. The message x(t) is a lowpass signal coming from an ergodic process with 
bandwidth W, normalized such that 





(t)| <1 S, =x? = <x%(N)><1 


All message signal averages will be taken in the statistical sense over the ensemble. 

The channel has transmission loss L but otherwise provides nearly distortion- 
less transmission and negligible time delay. (A distorting channel would require 
equalization by a bandpass device located before the detector because demodulation 
is a nonlinear operation—except for synchronous detection which permits baseband 
equalization.) To minimize notational changes, the modulated signal at the output of 


the channel will be represented by x,(¢) with carrier amplitude A.. The received sig- 
nal power is then 


S= =x? : [1] 


The corresponding transmitted waveform is VLx,(t), sO our previous expressions 
for Sy still apply but with A, replaced by VLA.. 

We’ll model the entire predetection portion of the receiver as a bandpass filter 
with transfer function Hp(f) having unit gain over the transmission bandwidth By. 
We can ignore any predetection frequency translation since it has the same effect on 
signal and noise— which is also true for any predetection gain. Thus, in the usual 


case of superheterodyne receiver, Hp(f) is the frequency response of the IF ampli- 
fier with frg = f. 


Under the foregoing conditions with the assumption of additive noise at the 
receiver's input, the total signal-plus-noise at the detector becomes 


v(t) = x(t)  n(t) [2] 


where n(t) represents the predetection noise. Eventually, we'll recast v(?) in envelope- 
and-phase or quadrature carrier form as 


u(t) = A,(t) cos [ext + o,(t)] = vi(t) cos wt — v(t) sin wet 


Noise: G( f) = No/2 
Receiver 
NL x(t) 


| 
on prn BPF pet | LPF 
Sr L ARF) Hp 

Sp=x? 


v(t) = x(t) + 22) 





Figure 10.1-1 Model of a CW communication system with noise. 
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which facilitates analysis of the demodulated signal-plus-noise y(t). Lastly, the 
demodulated waveform goes through a lowpass filter with Hp(f) to yield the final 
output yp(f) at the destination. This postdetection filtering may include deemphasis 
or other processing operations. 

The additive form of Eq. (2) together with the reasonable assumption of statis- 
tically independent signal and noise allows us to write 


V? = x2 +n? = Sg + Np [3] 


where Mg = n? is the predetection noise power. Before discussing the signal-to- 
noise ratio, we'll turn off the signal and examine Ne by itself. 

Figure 10.1—2a isolates that part of the system diagram relevant to Ng. Here, we 
treat the channel noise plus any noise generated in the predetection portion of the 
receiver as being equivalent to white noise. Hence, the filtered output n(t) has spec- 
tral density 


Gf) = PRAI 


as sketched in Fig. 10.1—-2). The density parameter Np includes all noise referred to 
the input of the receiver. We then say that 


Figure 10.1-2b is based upon a predetection filter with nearly square frequency 
response, so its noise bandwidth equals B7 and 





N, 
GU) = 3) HRS) 









N; 
yg fi 
ina 
0 
-Je Jet By Je-aBr fe 
(b) 
Figure 10.1-2 Bandpass Filtered white noise. (a) Block diagram; [b) power spectrum. 





10.1 Bandpass Noise 


Ng = | G,f)df = NoBr (4] 
A less selective filter would, of course, pass more noise power. 

Note that the carrier frequency f. does not necessarily fall in the center of the 
passband in Fig. 10.1-2b. We've written the lower cutoff as f. — aB; to include the 
symmetric-sideband case (o = 1/2) and the suppressed-sideband case (a = 0 or 1). 
Needless to say, the value of By depends upon the message bandwidth W and the 
type of modulation. 


From Eqs. (3) and (4), we now define the predetection signal-to-noise ratio 


(aii s 
N/p Ng NoBr 
which looks similar to the destination signal-to-noise ratio Sg/NọW defined in Sect. 


9.4 relative to baseband transmission. But (S/N)p should not be confused with 
Sp/NoW. To bring out the distinction, we'll introduce the system parameter 





y = S4 NW [6] 
such that 
(5) .¥ n 
Nie EST 


and hence (S/N)p = y since Br = W. You should keep in mind the interpretation that 
y equals the maximum destination S/N for analog baseband transmission with iden- 
tical values of Sp and No at the receiver. By the same token, Eqs. (5) and (7) are actu- 
ally upper bounds on (S/N), since the various imperfections in a practical system 
inevitably degrade the signal-to-noise ratio to some extent. 


Quadrature Components 
Now let n(f) be a sample function of an AWGN process. Then 


E oT S 
n=0 n = wv = Ne 


which follows from the absence of a dc component in G,(f). And the shape of G,(f) 
in Fig. 10.1-2b suggests expressing the noise in the usual bandpass form 


n(t) = nt) cos ext — n,(t) sin «xt [8] 


with in-phase component nt) and quadrature component 7,(f). These components 
are jointly stationary and gaussian, hke n(t), and have the following properties: 


m-n-0 n()n-o [9a] 


WS ne n= Ny [9b] 
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Figure 10.1-3 Lowpass power spectra of the quadrature components of bandpass noise. 





(a) General case; (b) symmetric-sideband case; (c) suppressed-sideband case. 


Equation (9a) means that the random variables n,(t) and n,(f) are independent at any 
instant f, so their joint probability density function is the product of identical gauss- 
ian PDF’s. 

The power spectral densities of the quadrature components are identical low- 
pass functions related to G,( f) by 


Gf) = GF) = Gf + fut + fe) + GF - AL -— uf -f)] ol 


where the term G,(f + f.)u(f + f.) simply stands for the positive frequency portion : 
of G,( f) translated downward and G,Cf — f.) [1 — uCf — f] stands for the negative- 
frequency portion translated upward. These terms then overlap and add for |f| = 
&B;, as illustrated by Fig. 10.1—3a. Figure 10.1—3b shows the complete overlap in 
the symmetric-sideband case (a = 1/2), and Fig. 10.1-3c shows the lack of overlap 
in the suppressed-sideband case (a = O or 1). When G,(f) has local symmetry 
around +f, as reflected in Fig. 10.1-3b, the quadrature components are uncorre- 
lated processes. 


EXERCISE 10.1-1 





for f > 0. 
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f-fo, MHz 





Figure 10.1-4 


Envelope and Phase 
As an alternative to Eq. (8), we also want to express bandpass noise in the form 
n(t) = A,(t) cos [wt + ¢,(t)] uu 


with envelope A,(t) and phase $,(t). The standard phasor diagram in Fig. 10.1—5 
relates our two sets of components. Clearly, at any instant of time, 


n 
A?-—g2-c a; P, = tan! = [12a] 
H 
and conversely 
n; = A, cos $, ng =A, sin >, [125] 


These nonlinear relationships make spectral analysis of A, and 4$, difficult, even 
though we know G, ( f) and G,,,( f). However, the lowpass spectrum of the quadra- 
ture components suggests that the time variations of A,(¢) and $,(f) will be slow 
compared to f., in agreement with the bandpass nature of n(¢). 


Furthermore, Eq. (12a) constitutes a rectangular-to-polar conversion of inde- 


pendent gaussian RVs, just like the one that led to the Rayleigh distribution. We thus 
conclude that the PDF of the envelope is a Rayleigh function given by 


An FA 
Pa lân) = T ^^t u(A,.) [13] 
R 





| n 


Figure 10.1-5 Phasor diagram for bandpass noise components. 
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with mean and second moment 
A, = Va«N,2 A = 2Np 


The probability that A, exceeds some specified positive value a is then 


PI ee) zi 
lb Mu ia ast E 


P(A, > a) Se M [14] 


OPITE LUN E 


These results follow from Eqs. (11)-(13) in Sect. 8.4. 
The phase $, has a uniform PDF over 277 radians, independent of A,. Hence, 





n^ = A? cos (ua,t + pa) = A2 X 1/2 = Ng 
which explains the factor of 2 in A2 = 2Np. 





EXERCISE 10.1-2 


Suppose bandpass noise with Ng = 1 uW is applied to a one-ohm resistor. Calculate 
the mean and rms value of the envelope voltage. Also evaluate P(A, > 2A,). 





iactu town e jid a aan MIT ab dat adnan 


Correlation Functions* . 1 


The properties of the quadrature components of bandpass noise were presented 
without proof in order to put the important results up front. Now we'll outline the 
derivation of those results by drawing upon various correlation functions. This 


analysis brings together concepts and relations from several previous chapters to 
shed further light on bandpass noise. 


We begin with the fictitious lowpass equivalent noise waveform defined by 
A 2 m 
nep(t) = a[n(t) + JAE) Je 


in which A(t) is the Hilbert transform of the bandpass noise n(t). The lowpass nature 


of ng,(t) is easily confirmed by deterministic Fourier transformation. But the quad- 
rature components of (t) should be such that 


M) + jng(t)) = na) 
Thus, equating the real and imaginary parts of ng,(t) yields 
nt) = n(t) cos ot + A(t) sin ot [15a] 
nj(t) = A(t) cos wt — n(t) sin «t [155] 


which establishes explicit relationships between the quadrature components and n(t). 
This expression contains much valuable information, as follows: 


It states the physically obvious fact that n,(t) and n,(¢) depend entirely on n(t)— 
remember that 7(t) represents a linear operation on n(t). 


If n(t) is gaussian then A (f) is gaussian, and since Eq. (15) shows that the quad- 


rature components are linear combinations of gaussian RVs at any instant, they 
must also be gaussian. 





10.1 Bandpass Noise 


3. Equation (15) provides the starting point for correlation analysis. 


4. Equation (15) brings out the importance of Hilbert transforms in the study of 
bandpass noise. 


The Hilbert transform of a random signal was previously considered in Exam- 
ple 9.2-3. Applying those results to the case at hand, we have 


G;(f) = GA) R(T) = R,(T) [16a] 
and 
Ra(7) = R (T) |^ Ra(r) = —R,(7) [165] 


Here, R,() stands for the Hilbert transform of R,(7), defined by R,(7) = 
ho(t)*R,(7) with hg(t) = lac. 


Having completed the necessary groundwork, we proceed to the autocorrela- 
tion function of the in-phase component nt). Into the basic definition 


R,(t, t — 7) = Eln,(t)n(t — 7)] 
we insert Eq. (15a) and perform some manipulations to get 


R,{t,t — 7) = LR, (7) + R,(t) | cos wt + [R (7) = R,a(t) | sin wt 


+ [R (T) — Ra(r)] cos e,Qt — 7) [Ra (7) + Rpa(t)] sin @(2t — 7) 
This cumbersome expression then simplifies with the help of Eq. (16) to 
R,(t,t — 7) = R,(T) cos wT + R,() sin er 


which is independent of t. The same result holds for the autocorrelation of 7,(?). 
Thus 


Ri) = Raa) = R(T) COS wer + R, (T) sin «XT [17] 


so the quadrature components are stationary and have identical autocorrelation and 
spectral density functions. 


To obtain the power spectral density via Fourier transformation of Eq. (17), we 
note that 


FIRAT) cos wr] = GUS f) + GG + f)] 
Then, using the convolution and modulation theorems, 
S [R,()] = S[ho(r)] LR.(7)] = (7j senf) GC) 
and 


FIRT) sin out] = 9&[R, (7) cos (or — 7/2)] 


= —$[-j sen (£  £)G — 5] + $L-sen f+ £)G.U 9 £2] 
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Therefore, 
G,(f) = G,(f) = 3[1 + sen (f+ AIG f) 
ol sacr 340]04 =f 


which reduces to Eq. (10) because the first term vanishes for f < —f, whereas the 
second term vanishes for f > f.. 


Finally, a similar analysis for the crosscorrelation of the quadrature components 
produces 


Ran (T) = Riz) sin wT = R,(r) cos wT [18] 


and 


SUR (51 = AGATA Rous 27) GPa acf eq pe 


If G,(f) has local symmetry around f = +f, then the right-hand side of Eq. (19) 
equals zero for all f. This means that Rar) = 0 for all 7, so the quadrature com- 
ponents are uncorrelated and statistically independent processes. 








10.2 | LINEAR CW MODULATION WITH NOISE 


Now we're prepared to deal with the situation in Fig. 10.2-1. The linearly modu- 
lated signal x,(t) is contaminated by AWGN at the input tc E the dip Predetec- 
tion bandpass filtering produces v(t) = x,(t) + n(t) with x2 = Ss and n? = Ns so 


(3) Sg Sp W 
NJ, Ne NBr Bi 


'The bandpass noise can be expressed in quadrature form as 





n(t) = nt) cos wt — nj(t) sin w,t 


where ni = n = Np = NoBr. The demodulation operation will be represented by 
one of the following idealized mathematical models: 


v(t) Synchronous detector 
t) = -— 
ye) A,(t) — A, Envelope detector 
N 
G)- F 
| U(0) = x,G) +n) 





x«t) BPF xt LPF yp) 
Ss A Ü Bo (5) 
Nip 


Figure 10.2-1 Model of receiver for CW modulation with noise. 











10.2 Linear CW Modulation with Noise 


These models presuppose perfect synchronization, and so forth, as appropriate. The 
term A, = <A,(t)> reflects the dc block normally found in an envelope detector. A 
detection constant could be included in our models but it adds nothing in the way of 
generality. 

The questions at hand are these: Given x,(f) and the type of detector, what's the 
final signal-plus-noise waveform yp(t) at the destination? And if the signal and noise 
are additive at the output, what's the destination signal-to-noise ratio (S/N) pj? 


Synchronous Detection 


An ideal synchronous detector simply extracts the in-phase component of v(t). If the 
modulation is DSB, then x(1) = A, x(t) cos wt so 


u(t) = [A, x(t) + n,(t)] cos wt — n,(t) sin oz uU 


and y(t) = v(t) = A, x(t) + nt). Thus, if the postdetection filter approximates an 
ideal LPF with bandwidth W, 


yp(t) = Acx(t) + nt) [2] 


We see that the output signal and noise are, indeed, additive, and that the quadrature 
noise component 7,(t) has been rejected by the detector. 
Furthermore, if the predetection filter has a relatively square response with 


bandwidth By = 2W centered at f,, then the output noise power will take the shape of 
Fig. 10.1-3b. Hence, 


G, Cf) = NyHCf/2W) [3] 


which looks like lowpass-filtered white noise. Under these conditions, we don’t . 


need any postdetection filter beyond the LPF within the synchronous detector. 

Next we obtain the postdetection S/N from Eq. (2) by taking the mean square 
values of the signal and noise terms. Upon noting that Np = n? and Sp= 
Aix? = A2S,, whereas Sp = x? = A2S,/2, we get 








( S ) Sp 25g È ) 
— = — = = — [4a] 
N/p Np NoBr N Jg 
or, since B, = 2W, 
S S 
(=) => =y DSB [45] 
N/p | NyW 


Therefore, insofar as noise is concerned, DSB with ideal synchronous detection has 
the same performance as analog baseband transmission. 

You might have suspected a different result in view of the predetection noise 
power Ng = N38, = 2NoW. However, the signal sidebands add in a coherent fashion, 
when translated to baseband, whereas the noise sidebands add incoherently. The 
sideband coherence in synchronous detection of DSB exactly counterbalances the 
double-sideband noise power passed by the predetection filter. 
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The preceding analysis is readily adapted to the case of an AM signal x,(t) = 
A [l + x(t)] cos ct, in which we've taken u = 1 for simplicity. If the synchronous 
detector includes an ideal dc block, then yp(t) will be as given in Eq. (2) so Sp = 
A25, and Np = n?. But when we account for the unmodulated carrier power in Sp = 
A2(1 + $3/2 we find that Sp = 25,S4/(1 + S,) and 


(=) TE (=) za AM [5] 
Now LES NA TIS” 


This ratio is bounded by (S/N)p = y/2 since S, x 1. 

Full-load tone modulation corresponds to S, = 1/2 and (S/N)p = y/3, which is 
about 5 dB below that of DSB with the same parameters. More typically, however, 
S, = 0.1 and AM would be some 10 dB inferior to DSB. AM broadcasting stations 
usually combat this effect with special techniques such as volume compression and 
peak limiting of the modulating signal to keep the carrier fully modulated most of 
the time. These techniques actually distort x(¢). 

For SSB modulation (or VSB with a small vestige) we have x,(t) = 
(A,/2)[x(t) cos wt + x(t) sin of] with Bp = W and $4 = A2S,/4. Synchronous 
detection rejects the quadrature component of both the signal and noise, leaving 


yp(t) = 7A,x(t) + n,(t) [6] 


so Sp = A2S,/4 = Sp. Since f. falls at either edge of an ideal predetection filter, 
G,,(f) has the shape of Fig. 10.1—3c. Hence, 








GG) ~ gra) m 


and Np = n2 = NW. Therefore, 


$ S 
nA S = 8 
Q) (5), a EF E 


which shows that SSB yields the same noise performance as analog baseband or 
DSB transmission. 


Finally, consider VSB plus carrier. If the vestigial band is small compared to W, 
then the predetection and postdetection noise will be essentially the same as SSB. 


But the signal will be essentially the same as AM with all the information-bearing 
power in one sideband. Hence, 


(2) MESS (3) mM: VSB + C [9] 
Ae TARANA Seas 


assuming that Bp ^ W and w= 1. 


To summarize the results in Eqs. (2)-(9), we state the following general proper- 
ties of synchronously detected linear modulation with noise: 


1. 








The message and noise are additive at the output if they are additive at the 
detector input. 
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2. If the predetection noise spectrum is reasonably flat over the transmission band, 
then the destination noise spectrum is essentially constant over the message band. 


3. Relative to (S/N)p, suppressed-sideband modulation has no particular advantage 
over double-sideband modulation because the coherence property of double side- 
band compensates for the reduced predetection noise power of single sideband. 


4. Making due allowance for the "wasted" power in unsuppressed-carrier systems, 
all types of linear modulation have the same performance as baseband trans- 
mission on the basis of average transmitted power and fixed noise density. 


These statements presume nearly ideal systems with fixed average power. 

Comparisons based on peak envelope power indicate that SSB yields a postde- 
tection S/N about 3 dB better than DSB and 9 dB better than AM, assuming a rea- 
sonably smooth modulating signal. But SSB is inferior to DSB if the message has 
pronounced discontinuities causing envelope horns. 





Suppose the predetection filter for a USSB signal actually passes f, — W/4 < |f| = EXERCISE 10.2-1 


f. + W. Use Fig. 10.1—3a to sketch the postdetection noise power spectrum. Then 
show that (S/N)p will be about 1 dB less than the value predicted by Eq. (8). 





Envelope Detection and Threshold Effect 


Inasmuch as AM is normally demodulated by an envelope detector, we should 


examine how this differs from synchronous detection when noise is present. At the 
detector input we have 


v(t) = AJ1 + x(t)] cos wt + [ni(t) cos wt — n,(t) sin w,t] [10] 


where we're still taking u = 1. The phasor construction of Fig. 10.2-2 shows that 
the resultant envelope and phase are 





A): VIALI x) + (OF + POT T 
zx -i nt) 
$0) = ten ALE Fal] + m0 





A {1 +x] 


Figure 10.2-2 Phaser diagram for AM plus noise with (S/N}p >> 1. 
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Clearly, further progress calls for some simplifications, so let's assume that the sig- 
nal is either very large or very small compared to the noise. 
Taking the signal to dominate, say A? >> n?, then A,[1 + x(2) will be large 


compared to n(f) and n,(f), at least most of the time. The envelope can then be 
approximated by 


Aj) = AY x(0] + ni) n2 
which shows the envelope modulation due to noise, similar to interference modula- 
tion. An ideal envelope detector reproduces the envelope minus its dc component, so 


yp(t) = A,(t) — A, = Acx(t) + ni(t) 


which is identical to that of a synchronous detector. The postdetection S/N is then as 
previously given in Eq. (5). Likewise, Eq. (9) will hold for envelope detection of 
VSB +C. ie EM 

But bear in mind that these results hold only when A2 > n?. Since A2/n? is 
proportional to Sp/N By, an equivalent requirement is (S/N)z => 1. (There is no such 
condition with synchronous detection.) Thus, providing that the predetection signal- 
to-noise ratio is large, envelope demodulation in the presence of noise has the same 
performance quality as synchronous demodulation. 

At the other extreme, with (S/N)z << 1, the situation is quite different. For if 
A2 << n?, the noise dominates in a fashion similar to strong interference, and we 
can think of x,(7) as modulating n(z) rather than the reverse. To expedite the analysis 
of this case, n(t) is represented in envelope-and-phase form n(t) = A,(t) cos [ot + 
¢,(t)], leading to the phasor diagram of Fig. 10.2—3. In this figure the noise phasor is 
the reference because we are taking n(t) to be dominant. The envelope is then 
approximated by the horizontal component, so 


A,(t) = A,(t) + A1 + x(t)] cos e(t) [13] 
from which 
y(t) = A,(t) + Ax(t) cos b,(t) — A, [14] 
where À, = V'rNg/2. 


The principal output component is obviously the noise envelope A,(f), as 
expected. Furthermore, there is no term in Eq. (14) strictly proportional to the mes- 








A, [1 +x] cos n 


Figure 10.2-3 Phasor diagram for AM plus noise with (S/N}p 1. 
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sage x(t). Though signal and noise were additive at the input, the detected message 
term is multiplied by noise in the form of cos $,(r), which is random. The message 
is therefore hopelessly mutilated, and its information has been lost. Under these cir- 
cumstances, an output signal-to-noise ratio is difficult to define, if not meaningless. 

The mutilation or loss of message at low predetection signal-to-noise ratios is 
called threshold effect. The name comes about because 





With synchronous detection, the output signal and noise are always additive. True, 
the message is buried in noise if (S/N), << 1, but the identity of x(7) is preserved. 

Actually, the threshold is not a unique point unless some convention is estab- 
lished for its definition. Generally speaking, threshold effects are minimal if A, => 
A, most of the time. To be more specific we define the threshold level as that value 
of (S/N). for which A, = A, with probability 0.99. Then 


S 
= = 41 = I 
( ) n 10 « 10 [15a] 


or, since (S/N), = y/2, 


Yn = 81n 10 = 20 [155] 
If (S/N) pz < (S/N) an (Or y < yq), Message mutilation must be expected, along with 
the consequent loss of information. 


Looking at the value of (S/N)p, and recalling that (S/N)p < (S/N) p leads to a 
significant conclusion: 





For audio transmission demands a postdetection signal-to-noise ratio of 30 dB or 
more, so (S/N)g is well above the threshold level. In other words, additive noise 
obscures the signal long before multiplicative noise mutilates it. On the other hand, 
sophisticated processing techniques exist for recovering digital signals buried in 
additive noise. Hence, if AM is used for digital transmission, synchronous detection 
may be necessary to avoid threshold effects. 

Lastly, let’s consider how an envelope detector can act in a synchronous fashion 
and why this requires large (S/N). Assuming the input noise is negligible, the diode 
in an envelope detector functions as a switch, closing briefly on the carrier peaks of 
the proper polarity; therefore the switching is perfectly synchronized with the carrier. 
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But when noise dominates, the switching 1s controlled by the noise peaks and syn- 
chronism 1s lost. The latter effect never occurs in true synchronous detectors, where 
the locally generated carrier can always be much greater than the noise. 





EXERCISE 10.2-2 Use Eq. (14), p. 404, to derive the threshold level in Eq. (15a). Specifically, show 
that if S, =1, then P(A, = A,) = 0.99 requires (S/N), = 41n 10 = 10. 








10.3 EXPONENTIAL CW MODULATION WITH NOISE 


This section deals with noise in analog PM and FM systems. The demodulation 
operation will be represented by 


ot) Phase detector 
XE s 
in e) Frequency detector 


As we saw in Chap. 5, the inherent nonlinear nature of exponential modulation leads 
to analytic difficulties—all the more so when noise must be considered. We'll there- 
fore begin with the large signal condition (S/N), => 1 to determine the postdetection 
noise characteristics and signal-to-noise ratios for PM and FM. Our efforts here pay 
off in results that quantify the valuable wideband noise reduction property, a prop- 
erty further enhanced by postdetection FM deemphasis filtering. 

But wideband noise reduction involves a threshold effect that, unlike the AM 
case, may pose a significant performance limitation. We'll qualitatively discuss 


operation near threshold, and take a brief look at the FM feedback receiver as one 
technique for threshold extension. 


Postdetection Noise 


The predetection portion of an exponential modulation receiver has the structure 
previously diagrammed in Fig. 10.2—1 (p. 406). The received signal is 


x(t) = A, cos [od + $(t)] 
where f(t) = x(t) for a PM wave or f(t) = 2zfax(t) for an FM wave. In either 


case, the carrier amplitude remains constant so 


Sg = 34 (=) ee UJ 
OTT ANA 2 
and (S/N), is often called the carrier-to-noise ratio (CNR). The predetection BPF- 
is assumed to have a nearly ideal response with bandwidth B; centered at f.. 


Figure 10.3-1 portrays our model for the remaining portion of the receiver, with 
the detector input u(t) = x,(f) + n(t) = A(t) cos [wt + 6,(4)]. The limiter sup- 
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u(t) = x(t) + a(t) = AG) cos [et + OD] 


i | e 








Limiter Discriminator \ HAF) yp(t) 
(5) Ag? $y) PM 
R N, = s 
oBr yO L 3. EM 


Figure 10.3-1 Model for detection of exponential modulation plus noise. 


presses any amplitude variation represented by A,(¢). To find the signal and noise 
contained in $, (f), we express n(t) in envelope-and-phase form and write 


v(t) = A, cos [ot + p(t)] + A(t) cos [out + ¢,(t)] [2] 


The phasor construction of Fig. 10.3-2 then shows that 


e(t) = f(t) + tan! A sim [ir ó(] — [3] 
A, + A(t) cos [b,(t) — &(0] 

The first term of $, (f) is the signal phase by itself, but the contaminating second 

term involves both noise and signal. Clearly, this expression is very unclear and we 
can't go much further without some simplifications. 

A logical simplification comes from the large-signal condition (S/N), => 1, so 

A, > A,(r) most of the time and we can use the small-argument approximation for 

the inverse tangent function. A less obvious simplification ignores $(2) in Eq. (3), 

replacing $,(f) — $(f) with $,(f) alone. We justify this step for purposes of noise 

analysis by recalling that $, has a uniform distribution over 27 radians; hence, in 


the sense of ensemble averages, ¢, — d differs from ¢, only by a shift of the mean 
value. With these two simplifications Eq. (3) becomes 


bolt) = P(t) + v(t) [4] 


where 





A A, sin e.t) E l 
yt) = 5 = US n,(t) [5] 


in which we've substituted n, = A, sin $, and Sp = A22. 





Figure 10.3-2 Phasor diagram of exponential modulation plus noise. 


413 


414 


CHAPTER 10 @ Noise in Analog Modulation Systems 


Equation (4) says that the signal phase (t) and the equivalent phase noise w(t) 
are additive under the large-signal condition. Equation (5) brings out the fact that 
w(t) depends on the quadrature component of n(t) and decreases with increasing 
signal power. 

Now let (f) = 0 and consider the resulting noise y(t) at the output of a phase 
detector. The PM postdetection noise power spectrum has the shape of G, ( f) in 
Fig. 10.1-35 (p. 402), but multiplied by 1/2Sp because y? = n 2/28g. Hence, 


N, 
GF) ~ Ea n(£) [é] 


which is essentially flat over |f| = B,/2, as sketched in Fig. 10.3-3. 

Since B7/2 exceeds the message bandwidth W, save for the special case of 
NBPM, the receiver should include a postdetection filter with transfer function 
Hp(f) to remove out-of-band noise. If Hp(f) approximates the response of an ideal 


LPF with unit gain and bandwidth W, then the output noise power at the destination 
will be 


NW 





m= | "APA = 


PM [7] 


The shaded area in Fig. 10.3-3 equals Np. 


Next consider a frequency detector with input $,(t) = Y(t), so the output is the 
instantaneous frequency noise 


| jM 1 
g(t) 8 24) = a) [8 
O S On IVS, ` 
Thus, from Eq. (21), p. 370, we get the FM postdetection noise power spectrum 
Nof^ ( f 
GF) = Ont? cue Gulf) 7 7 of n(Z) 91 
Sp T 


This parabolic function sketched in Fig. 10.3—4 has components beyond W < B4/2, 
like PM, but increases as f?. 





Figure 10.3-3 PM postdetection noise spectrum. 
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Figure 10.3-4 


FM postdetection noise spectrum. 


If we again take the postdetection filter to be an essentially ideal LPF that 
passes the shaded region in Fig. 10.3-4, the destination noise power will be 





- i _ Nw? 
N = Gel f df = xm FM [10] 
w R 


However, if we also incorporate deemphasis filtering such that |H,(f)| = 
Fa IG/2W) with |Haf)| = [1 + Gf/ Bae)? then 


Np = [ mscr Ge(f df = E (25) — a (2-)| [114] 


de 
In the usual case where W/B,, > 1, Eq. (11a) simplifies to 
Np NgBAW/S&  Deemphasized FM 
since tan”! (W/B4,) = 77/2 << WIB,.. 
Let’s summarize and comment on our results up to this point as follows: 


[115] 


1. The postdetection noise spectral densities in PM and FM have out-of-band 
components that call for postdetection filtering. 

2. The PM noise spectrum is flat, like linear modulation except for the out-of-band 
components. 

3. 'The FM noise spectrum increases parabolically, so higher baseband signal fre- 
quencies suffer more noise contamination than lower frequencies. Deemphasis 
filtering compensates for this effect, provided that the message has been preem- 
phasized at the transmitter. 

4. 


The destination noise power Np in PM and FM decreases as Sp increases, a phe- 


nomenon known as noise quieting. You may hear noise quieting on an FM 
radio with the volume turned up when you tune between stations. 





\f| > W. Then estimate the total area Np assuming Br 22» W >> B,,. Compare your 
result with Eq. (115). 
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Exponential CW Modulation with Noise 


Find the deemphasized noise spectrum |Ha (f) Gf) without a lowpass cutoff for EXERCISE 10.3-1 
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Destination S/N 


Now we can calculate the signal-to-noise ratios for PM, FM, and deemphasized FM. 
We continue the large-signal condition (S/N)p => 1 so our previous results for Np 
still hold. The presence of a signal phase (2) does not void those results even 
though we replaced $,(t) — (£) with $, (2) in Eq. (3). Had we included ¢(t) in the 
phase noise, a more complicated analysis would have revealed that the postdetection 
noise spectrum includes additional components that fall outside the message band 
and are rejected by Hp( f). 


The demodulated signal plus noise in a PM system with $(t) = ¢,x(d) is 
y(t) = &(t) = éax(t) + vr) 


The postdetection filter passes the signal term $,x(t) so Sp = px? = = fS,, and the 
output noise power Np is given by Eq. (7). Hence, 


S PAS, n 
NR E ; 
(5), (NaW/S,) S D Pam ARM us 


Since y equals the output S/N for analog baseband transmission (or suppressed- 
carrier linear modulation) with received power Sz, bandwidth W, and noise density 
No, we see that PM gives an improvement over baseband of exactly $25,. But in 
view of the alee constraint ġa = 7, the PM improvement is no greater than 
PRS dma = = q?, or about 10 dB at best. In fact if $2.5, < 1, then PM performance is 
inferior to baseband but the transmission bandwidth is still By = 2W. 
The demodulated signal plus noise in an FM system with b(t) = 2mfax(t) is 


v(t) = z-à0 = fat) + £0 


The postdetection filter passes the signal term fax(t) so Sp = ff S,, and Np is given by 


Eq. (10). Hence, 
3), aan E 
N/p  (NoW?/3Sp) W) *NyW 


in which we spot the deviation ratio D = f,/W. We therefore write 


S 
—|23Dp? FM 13] 
(5), d 


and it now appears that (S/N)p can be made arbitrarily large by increasing D with- 
out increasing the signal power Sg—a conclusion that requires further qualification 
and will be reexamined shortly. 

Meanwhile, recall that the transmission bandwidth requirement By increases 


with the deviation ratio. Therefore, Eq. (13) represents wideband noise reduction 
in-that we have the latitude to 
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ping (S/N]3 fixed. 





To emphasize this property, take the case of wideband FM with D >> 1 and B, = 
2f, 2» W. Then D = B,/2W and Eq. (13) becomes 


(=) =2(22)'s WBEM [14] 
N], ANW 


which shows that (S/N), increases as the square of the bandwidth ratio BYW. With 
smaller deviation ratios, the break-even point compared to baseband transmission 
occurs when 3D?S, = 1 or D = 1/ V3S, = 0.6. The dividing line between NBFM 
and WBFM is sometimes designated to be D = 0.6 for this reason. 

Finally, if the receiver includes deemphasis filtering and By, << W, the output 
noise is further reduced in accordance with Eq. (115). Thus, 


S fu M 
—} =|——]} Qy Deemphasized FM [15] 
N D B de 


` and we have a deemphasis improvement factor of about (W/B4,)?/3. This improve- 

ment requires preemphasis filtering at the transmitter and may carry a hidden 
penalty. For if the message amplitude spectrum does not roll off at least as fast as 1/f, 
like an audio signal does, then preemphasis increases the deviation ratio and the 
transmission bandwidth requirement. 
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the broadcast FM parameters fa = 75 kHz, W = 15 kHz, and D = 5. Taking S, = 1/2 
for a representative value, Eq. (13) gives 


(S/N)p = (3 X 5? x 1/2)y = 38y 


or about 16 dB better than analog baseband transmission. Deemphasis filtering with 
Bae = 2.1 KHz increases (S/N), to about 640y. Thus, other factors being equal, a 1-W 
FM system with deemphasis could replace a 640-W baseband system. The cost of 
this transmitted power reduction is increased bandwidth, since FM with D = 5 
requires By = 14W. 

But several practical factors work against full realization of increased bandwidth in 
exchange for reduced transmitter power S7. And indeed the goal of FM broadcasting 
is maximum (S/N) rather than minimum S;. However, other applications involve 
minimizing Sr or squeezing as much as possible from every available transmitted 
watt. The large-signal condition (S/N), => 1 then poses a serious limitation for such 
applications, and the FM threshold effect becomes a matter of grave concern. 


EXAMPLE 10.3-1 
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EXERCISE 10.3-2 Calculate the minimum transmitted power needed when a PM system replaces the 
1-W FM system in Example 10.3-1 and the value of (S/N)p remains the same. 





FM Threshold Effect 


The small-signal condition (S/N), < 1 can be represented by a phasor diagram like 
Fig. 10.3-2 with signal and noise phasors interchanged. Then, since A,(f) => A, 
most of the time, the resultant phase at the detector input is 


A. sin [ó(r) — $n] 
bolt) = palt) + Af) [16] 
The noise now dominates and the message, contained in $(f), has been mutilated 
beyond all hope of recovery. 

Actually, significant mutilation begins to occur when (S/N); ~ 1 and A, ~ A. 
With phasors of nearly equal length, we have a situation similar to cochannel inter- 
ference when p = A/A. ~ 1. Small noise variations may then produce large spikes 
in the demodulated FM output. The phasor diagram in Fig. 10.3—5a illustrates this 
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Figure 10.3-5 FM near threshold. (a) Phasor diagram; (b] instantaneous phase and frequency. 
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point taking d(t) = 0 and ¢,(t,) = —7 so $, (tj) = —r. If the variations of A,(t) and 
$,(t) follow the dashed locus from 1, to t; then $,(t;) ^ +77. Correspondingly, the 
waveform @,(t) in Fig. 10.3-5b has a step of height 27 and the output 
y(t) = ¢,(t)/2m has a unit-area spike. These spikes would be heard on an FM radio 
as a crackling or clicking sound that masks the signal. 

We infer from this qualitative picture that the output noise spectrum is no longer 
parabolic but tends to fill in at dc, the output spikes producing appreciable low- 
frequency content. This conclusion has been verified through detailed analysis using 
the “click” approach as refined by Rice. The analysis is complicated (and placed 
beyond our scope) by the fact that the spike characteristics change when the carrier 
is modulated—called the modulation-suppression effect. Thus, quantitative results 
are obtained only for specific modulating signals. In the case of tone modulation, the 
total output noise becomes 


NW? 12D 
Np = ——— |1 


so ee 2 
where the second term is the contribution of the spikes. See Rice (1948) and 
Stumpers (1948) for the original work. 

Figure 10.3—6 shows (S/N), in decibels plotted versus y in decibels for two val- 
ues of the deviation ratio D, taking tone modulation and Np given by Eq. (17). The 
rather sudden drop-off of these curves is the FM threshold effect, traced to the expo- 
nential factor in Eq. (17). We see that 



































y, dB 


Figure 10.3-6 FM noise performance [without deemphasis]. 
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Below threshold, noise captures the output just like strong cochannel interference. 
Experimental studies indicate that noise mutilation is negligible in most cases of 
interest if (S/N)z = 10 or thereabouts. Hence, we define the threshold point to be at 
(S/N an = 10 [18] 
Equivalently, since (S/N)g = (W/Bzyy, 


Br 
Ya = 10-7 = 20M(D) [19a] 


=20(D+2) D>2 [195] 


where use has been made of the FM bandwidth equation By = 2M(D)W = 
2(D + 2)W. Equations (18) and (19) also apply to PM with D replaced by dy. 

Figure 10.3-6 correctly demonstrates that FM performance above threshold is 
quite impressive—after all, baseband transmission at best gives (S/N)p = y. And 
these curves do not include the additional improvement afforded by deemphasis fil- 
tering. But observe what happens if we attempt to make (S/N) arbitrarily large by 
increasing only the deviation ratio while holding y fixed, say at 20 dB. With D = 2 
(Br ^» 7W) we are just above threshold and (S/N) ~ 28 dB. But with D = 5 (Br = 
14W) we are below threshold, and the output signal is useless because of mutilation. 
We therefore cannot achieve an unlimited exchange of bandwidth for signal-to-noise 
ratio, and system performance may actually deteriorate with increased deviation. 

Swapping bandwidth in favor of reduced signal power is likewise restricted. 
Suppose, for example, that a 30-dB signal-to-noise ratio is desired with a minimum 
of transmitted power but the transmission bandwidth can be as large as By = 14W. 
Were it not for threshold effect, we could use FM with D = 5 and y = 14 dB, a 
power saving of 16 dB compared to baseband. But the threshold point for D = 5 is 
at Ym = 22 dB, for which (S/N)p = 37 dB. Thus, 





Correspondingly, the potential power reduction may not be fully realized. 
In view of these considerations, it’s useful to calculate (S/N); at the threshold 
point. Thus, again omitting deemphasis, we substitute Eq. (19) into Eq. (13) to get 
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S 
A — 3pD?s [20 ] 
| (8). > xYth a 
a ~60D(D+2)S D>2 [205] 
»" which equals the minimum value of (S/N)p as a function of D. Given a specified 
| value for (S/N)p and no bandwidth constraint, you can solve Eq. (20) for the devia- 


tion ratio D that yields the most efficient performance in terms of signal power. Of 
course, some allowance must be made for possible signal fading since it is unadvis- 
able to operate with no margin relative to the threshold point. 


E 
‘at 








a Suppose a minimum-power FM system is to be designed such that (S/N), ~ 50 dB, EXAMPLE 10.3-2 
given 5, = 1/2, W = 10 KHz, No = 1078 W/Hz, and no constraint on By. Temporarily 
il ignoring threshold, we might use Eq. (13) to get 10° = 1.5D*y so y = 296 when 


D = 15. But taking threshold into account with the stated values and the assumption 
that D > 2, Eq. (20) becomes 10° = 60D7(D + 2)/2, and trial-and-error solution 
yields D = 15 so B, = 2(D + 2)W = 340 kHz. Then, from Eq. (19a), Sy/NW = 
Yn = 10 X 34 = 340, which requires Sp = 340N,)W = 34 mW. 
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Find the minimum useful value of (S/N); for a deemphasized FM system with By = 
SW, fa = 10B,,, and S, = 1/2. 


EXERCISE 10.3-3 
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Threshold Extension by FM Feedback* 


Since the threshold limitation yields a constraint on the design of minimum-power 
analog FM systems, there has been interest in threshold extension techniques. Long 
ago Chaffee (1939) proposed a means for extending the FM threshold point using a 
frequency-following or frequency-compressive feedback loop in the receiver, called 
an FM feedback (FMEB) receiver. 

The FMFB receiver diagrammed in Fig. 10.3—7 embodies features of a phase- 
lock loop within the superheterodyne structure. Specifically, the superhet’s LO has 
been replaced by a VCO whose free-running frequency equals f, — fj. The control 
voltage for the VCO comes from the demodulated output yp(). If the loop has suffi- 
cient gain K and (S/N )p is reasonably large, then the VCO tracks the instantaneous 
phase of x,(/). This tracking action reduces the frequency deviation from fa to 
fal + Kj, as well as translating the signal down to the IF band. Thus, if K is such 


that f4/(1 + K)W < 1, then the IF input becomes a narrowband FM signal and the IF 
bandwidth need be no larger than Byp ~ 2W. 





VCO tracking likewise reduces the noise frequency deviation by the same factor, 
so (S/N)p equals that of a conventional receiver when (S/N); >> 1. But note that the 
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Figure 10.3-7 FMFB receiver for threshold extension. 


IF has a larger predetection signal-to-noise ratio, namely (S/N); = Sg/Nogip ~ 
(By/2W)(S/N )p. Since threshold effects now depend primarily on the value of (S/N );p, 
the threshold level has been extended down to a lower value. Experimental studies 
confirm a threshold extension of 5-7 dB for FMFB receivers—a signficant factor for 
minimum-power designs. A conventional receiver with a PLL demodulator also pro- 
vides threshold extension and has the advantage of simpler implementation. 


10.4 COMPARISON OF CW MODULATION SYSTEMS 


At last we're in a position to make a meaningful comparison of the various types of 
analog CW modulation. Table 10.4-1 summarizes the points to be compared: nor- 
malized transmission bandwidth b = B;/W, destination signal-to-noise ratio (S/N) p 
normalized by y, threshold point if any, dc response (or low-frequency response), and 
instrumentation complexity. The table also includes baseband transmission for refer- 
ence purposes. As before, we have used y = S/N W where Sp is the received signal 
power, Wis the message bandwidth, and No = kUy is the noise density referred to the 
receiver input. We have also used S, = x? = <x?(r)>, where x(f) is the message. 
Nearly ideal systems are assumed, so the values of (S/N) p are upper bounds. 

Of the several types of linear modulation, suppressed carrier methods are supe- 
rior to conventional AM on at least two counts: signal-to-noise ratios are better, and 
there is no threshold effect. When bandwidth conservation is important, single 
sideband and vestigial sideband are particularly attractive. But you seldom get 
something for nothing in this world, and the price of efficient linear modulation is 
the increased complexity of instrumentation, especially at the receiver. Synchronous 
detection, no matter how it’s accomplished, requires sophisticated circuitry com- 
pared to the envelope detector. For point-to-point communication (one transmitter, 
one receiver) the price might be worthwhile. But for broadcast communication 
(one transmitter, many receivers) economic considerations tip the balance toward 
the simplest possible receiver, and hence envelope detection. 

From an instrumentation viewpoint AM is the least complex linear modulation, 
while suppressed-carrier VSB, with its special sideband filter and synchronization 
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10.4 Comparison of CW Modulation Systems 


requirements, is the most complex. Of DSB and SSB (in their proper applications) 
the latter is less difficult to instrument because synchronization is not so critical. In 
addition, improved filter technology has made the required sideband filters more 
readily available. Similarly, VSB + C is classed as of "moderate" complexity, 
despite the vestigial filter, since envelope detection is sufficient. 


Table 10.4-1 Comparison of CW modulation systems 














Type b=BAW  (SIN)n + y Yih DC Complexity Comments 
Baseband 1 1 M No! Minor No modulation 
2 
us, ; 
AM 2 a =a 20 No Minor Envelope detection 
1+ uS 
nx 
DSB 2 1 Yes Major Synchronous detection 
SSB 1 1 No Moderate Synchronous detection 
VSB 1+ 1 Yes Major Synchronous detection 
2 
HS, 
VSB + C 1+ EYE 20 Yes? Moderate Envelope detection 
1+ ws, 
<1 
PM? 2M($4) iS, 10b Yes Moderate Phase detection, 
constant amplitude 
$a xa 
FM?4 2M(D) 3D°S, 10b Yes Moderate Frequency detection, 


constant amplitude 


‘Unless direct coupled. 


2With electronic dc restoration. 
3b = 2. 
^Deemphasis not included. 


Compared to baseband or linear modulation, exponential modulation can pro- 
vide substantially increased values of (S/N)p—especially FM with deemphasis— 
with only moderately complex instrumentation. Figure 10.4—1 illustrates this in a 
form similar to Fig. 10.3~6 (again taking S, = 1/2) except that a representative 12-dB 
deemphasis improvement has been added to the FM curves and performance below 
threshold is omitted. All curves are labeled with the bandwidth ratio b. 

Clearly, for equal values of b, FM is markedly superior to PM insofar as noise 
performance is concerned. And as long as the system is above threshold, the improve- 
ment can be made arbitrarily large by increasing b, whereas PM is limited to b = 10 
since ġa = 7. The penalty for the FM improvement is excessive transmission band- 
width. Therefore, wideband exponential modulation is most appropriate when 
extremely clean output signals are desired and bandwidth conservation is a secondary 
factor. At microwave frequencies, both the noise-reduction and constant-amplitude 
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Figure 10.4-1 Performance of CW modulation systems, including 12-d8 deemphasis improve- 
ment for FM. 


properties are advantageous. As to power conservation, FM with moderate values of 
b does offer a saving over linear modulation, threshold limitations notwithstanding. 

Regarding transmission of modulating signals having significant low-frequency 
components, we have already argued the superiority of DSB and VSB. For facsimile 
and video, electronic dc restoration makes envelope-detected VSB possible and 
desirable. (AM could be used in this way, but the bandwidth is prohibitive. 
Suppressed-carrier single sideband is virtually out of the question.) Also we noted 
previously that a balanced discriminator has excellent low-frequency response; 
hence, the low-frequency performance of FM can equal that of DSB or VSB, and 
without troublesome synchronization. For similar reasons, high-quality magnetic- 
tape recorders are often equipped with an FM mode in which the input is recorded as 
a frequency-modulated wave. 

Not shown in the table is relative system performance in the face of time- 
varying transmission characteristics, frequency-selective fading, multiple-path 
propagation, and so forth. An unstable transmission medium has a multiplicative 
effect, which is particularly disastrous for envelope detection. (Late-night listeners 
to distant AM stations are familiar with the garbled result.) Similarly, transmission 
instabilities often preclude wideband modulation. 

To summarize briefly is impossible. There is no universal solution to all com- 
munication problems. The communication engineer must therefore approach each 
new task with an open mind and a careful review of all available information. 
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10.5 Phase-Lock Loop Noise Performance 





10.5 PHASE-LOCK LOOP NOISE PERFORMANCE* 

In Chapter 7, we assumed the PLL input was noise free. This is usually not the case, 

so we now want to consider the effects of additive white noise on PLL performance. 
Let's augment the linearized models of Figs. 7.3—8b and 7.3-8c with a noise 

source as shown in Figure 10.5-1. The output of the second summer is the phase 

error e(t). Since the system is linear, we can also consider the signal and noise inputs 


separately, and so with just the noise input, the power spectral density (PSD) of the 
VCO output phase is 


Galf) = IH GO? GY) (1] 


where G,(f) is the PSD of the noise and H,(f) is the PLL closed loop transfer func- 
tion previously derived in Sect. 7.3. If the noise source is white, then G,(f) = No/2 
and the VCO output variance (or noise) becomes 


o, = No | ILA)? df vi 
0 


Since we have assumed the input $(f) = O then the phase error noise becomes 
o = o3. Given thatthe noise equivalent bandwidth is 


By = if IRLA) df ii 
O49 


and assuming that g = 1, the phase error noise becomes 
a? = No By [4] 


This phase noise represents the amount of variation, or jitter, contained in the VCO 
output and is called phase-jitter. 


When Eqs. (3) and (4) are examined closely, we see that phase jitter decreases 
with narrower loop bandwidths. Therefore loop bandwidth versus phase jitter 
becomes another trade-off we have to deal with. Decreasing the loop bandwidth will 


GaC f) =No/2 


D(F) | 
Hf) —- y(t) 
p(t) e(t) 








Figure 10.5-1 Linearized PLL model with additive noise 
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decrease phase jitter, but adversely affects the ability of the system to track the input 
signal's phase variations. 


If we assume the input signal is sinuosidal with amplitude A,, then we can 
express the loop's signal-to-noise ratio by 


A? 
pee (5 


Oe 


Equations (4) and (5) assume a linear PLL model with a relatively high signal- 
to-noise ratio. If this is not the case, a better approximation for phase-jitter for a 
first-order PLL is given by Vitterbi (1966) as 


ua | , exp(y cos ¢) 
! = 2arJo(y)) 


where J9(»y) is the zeroth-order modified Bessel function of the first kind. 


do [6] 





10.6 ANALOG PULSE MODULATION WITH NOISE 


Finally, we take a brief look at analog pulse modulation with noise. For simplicity, 
we'll assume that the pulse-modulated signal has been sent at baseband, so there 
will be no CW modulation and no bandpass noise. Instead, the predetection noise 
will have a lowpass spectrum. 


Signal-to-Noise Ratios 


Regardless of the particular circuitry employed in the receiver, demodulating an 
analog pulse-modulated wave boils down to message reconstruction from sample 
values. A generalized demodulator would therefore consist of a pulse converter that 
transforms the pulse-modulated wave into a train of weighted impulses from which 


an ideal LPF reconstructs the message. Figure 10.6—1 diagrams this demodulation 
model including additive noise. 


u(t) = X p(t) + n(t) 























Noise | Reconstruction 
xp(t) Pulse ys) yp) 
Exc xr js converter [3 Pu 
N = f/2 


I 


t 
G= 7 ] 


Synchronization 


Figure 10.6-1 Model for demodulation of analog pulse modulation with noise. 
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10.6 Analog Pulse Modulation with Noise 


A noise-limiting filter at the input passes the received pulsed signal x,(t) plus 
lowpass noise n(7), so v(t) = x,(t) + n(¢). The converter measures the amplitude, 
duration, or position of each pulse in v(t) and generates 


xt) = 2 T + e]&(r — kr) ul 
k 
where u, is the modulation constant relating x(kT;) to x, (f) and e, is the measure- 
ment error induced by the noise. We'll assume for convenience that the reconstruc- 
tion filter has bandwidth B = f/2, gain K = T, = 1/2B, and zero time delay. Hence, 
the impulse-train input ys(f) produces the final output 


volt) Eam) ex] sinc (Je = E)-uaux(Ü) + nol) al 


with 
np(t) = J, e,sinc(f,t — k) [2b] 
k 


which represents the noise at the destination. 

Since the errors e, are proportional to sample values of the lowpass noise n(t) 
spaced by T,, and since the noise-limiting filter has By > 1/T,, the values of e, will 
be essentially uncorrelated and will have zero mean. We can therefore write the des- 
tination noise power as Np = nj, = e; = o? and the signal power is Sp —u2x^ = 


hos c Hence, 
S Sos BE 
N D Np o~ 


which expresses the destination signal-to-noise ratio in terms of the error variance 
o° caused by reconstruction from noisy samples. Our next task is to determine u3 
and o? for specific types of analog pulse modulation. 

A PAM signal contains the message samples in the modulated pulse amplitude 
Ao[1 + px(kT,)], so the modulation constant is Mp = Aou = Ap. From Sect. 9.5, the 
amplitude error variance is o? = o2 = N,By. Thus, under the best conditions of 
maximum modulation (u = 1) and minimum noise bandwidth (By = 1/27), we have 


( S ) 2A?r 
ande = S, 
N/p M 
where 7 is the pulse duration. 


When u = 1 and x(t) has no dc component, the average energy per modulated 
pulse is AS[1 + x(kT,)]?r = AS(1 + S,)r. Multiplying this average energy by the 
pulse rate f, gives the received signal power Sp = f,A%(1 + Sr. We thus obtain our 
final result in the form 
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(=) DEM A (=) : i 
NX NW wes uu SS i 
This result shows that (S/N )p = y/2, so that PAM performance is at least 3 dB below 
unmodulated baseband transmission—just like AM CW modulation. The maximum 
value is seldom achieved in practice, nor is it sought after. The merit of PAM resides 
in its simplicity for multiplexing, not in its noise performance. 

However, PPM and PDM do offer some improvement by virtue of wideband 
noise reduction. For if By ~ Bry, the time-position error variance is o? = o} = 
N,/(4B7A?). Since the pulse amplitude A is a constant, the received power can be 


written as Sp = f, A?ro, where Tg denotes the average pulse duration in PDM or the 
fixed pulse duration in PPM. Equation (3) then becomes 


(s) QUEM = 4&2 B Sa S [5] 
N/b No EU OPE T N fto 7 





Ww 
= 4p s (7) S Y PDM or PPM 
s'0 


This expression reveals that (S/N)p increases with increasing transmission band- 
width Bj. The underlying physical reason should be evident from Fig. 9.5-2, 
p. 388, with t, = 1/2B,. 

The PPM modulation constant is the maximum pulse displacement, so up = fo. 
The parameters £j and 7, are constrained by 


to 1/2 To = T = 2t, = 1B, 


and f, = 1/T, = 2W. Taking all values to be optimum with respect to noise reduction, 
we obtain the upper bound 


S 1 / BY 
—} =-|—] sS PP 6 
e C) pe SM x 


Hence, PPM performance improves as the square of the bandwidth ratio BYW. A 
similar optimization for PDM with ju, = 179 = Tọ yields the less-impressive result 


S 1 Br i 
—|;=-— PDM 7 
(5), QW 5 


To approach the upper bound, a PDM wave must have a 50 percent duty cycle so 
that To = T2. 

Practical PPM and PDM systems may fall short of the maximum values pre- 
dicted here by 10 dB or more. Consequently, the noise reduction does not measure 
up to that of wideband FM. But remember that the average power Sp comes from 
short-duration high-power pulses rather than being continuously delivered as in CW 


modulation. Power-supply considerations may therefore favor pulsed operation in 
some circumstances. 
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Explain why a single-channel PDM system must have uro = 1/4W Then derive Eq. EXERCISE 10.6-1 
(7) from Eq. (5) with up = MT. 





False-Pulse Threshold Effect 


Suppose you try to increase the value of (S/N)p in a PDM or PPM system by making 
By very large. Since n? increases with By, the noise variations in u(t) = xy) + n(t) 
will eventually dominate and be mistaken for signal pulses. If these false pulses 
occur often, the reconstructed waveform has no relation to x(t) and the message will 
have been completely lost. Hence, pulse-time modulation involves a false-pulse 
threshold effect, analogous to the threshold effect in wideband FM. This effect does 
not exist in PAM with synchronization because we always know when to measure 
the amplitude. 

To determine the threshold level, we’ll say that false pulses are sufficiently 
infrequent if P(n = A) < 0.01. For gaussian noise with o2, = n? = NoBr, the corre- 
sponding threshold condition is approximately A = 20 y, so the pulse must be strong 
enough to “lift” the noise by at least twice its rms value. (This is the same condition 
as the tangential sensitivity in pulsed radar systems.) Using the fact that A? = 
Spl Tof, We have Sp/tof, = 4No Br or 


= ($5) =a Stes is 
Yu NW a 0s y T 


This threshold level is appreciably less than that of FM, so PPM could be advanta- 
geous for those situations where FM would be below its threshold point. 











10.7 PROBLEMS 


White noise with Ny — 10 is applied to a BPF having BKA plotted in Fig. 
P10.1-1. Sketch GAS) taking f. —f, and show therefrom that n? = pn. 


10.1-1 





f 
-h -f 0 fi-> f h f*2b 
Figure P10.1-1 


10.1-2 Do Prob. 10.1-1 taking f. = f. 


10.1-3* White noise is applied to a tuned circuit whose transfer function is 


eoho 
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10.1-10* 


10.1-11 ` 


10.1-12 


10.1-13 
10.1-14 
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Evaluate G,(f) > No at f/f, = 0, £0.5, £1, £1.5, and £ 2. Then plot G, ( f ). 


The BPF in Fig. 10.1-2 (p. 400) will have local symmetry around +f. if its lowpass 
equivalent function H,,(f) = = Hal ft A )u(f + f.) has the even-symmetry property 
Ha -A| = Hef). (a) Let G,(f) = (No/2)|He,( f )| and show that we can write 


Gof =f) f>0 


Gf) = Golf- 1) + Gu F +5) = G (f +f) f<0 


(b) Use this result to show that G, (f) = 2Ga(f). 


A tuned circuit with Q = f/B, >> 1 approximates the local-symmetry property 
in Prob. 10.1-4, and has Af) = VU + j2(f — f£) Br] for f > 0. (a) Find Gz, (f). 
(b) Evaluate n? by calculating n?. 

Let y(t) = 2n(t) cos (w,t + 0), where n(t) is bandpass noise centered at f.. Show that 
y(t) consists of a lowpass component and a bandpass component. Find the mean 
value and variance of each component in terms of the properties of n(t). 


Bandpass gaussian noise with oy = 2 is applied to an ideal envelope detector, 
including a dc block. Find the PDF of the output y(7) and calculate oy. 


Bandpass gaussian noise with variance of is applied to an ideal square-law device, 
producing y(A = A2(t). Find y, y?, and the PDF of y(t). 
Let velt) = 3[v(t) + jo(t) Je 7". Assuming that v(z) is Fourier transformable, 


show that V (f) = Flv aA] = Vf + fou(f + f). Then sketch V,,(f) taking V(f) as 
in Fig. P10.1-1 with f. = f| + b. 


Let H&(f) in Fig. 10.1-2 (p. 400) be an ideal BPF with unit gain and let a = 1/2 so 
f. is the center frequency. Find R,(r) and R,(7) by taking the inverse transform of 
G,(f) and (~j sgn f)G,(f). Then obtain R, (7) from Eq. (17) (p. 405) and confirm 
that it equals the inverse transform of G, ( f). 

Do Prob. 10.1—10 with a = 0, so f, is the lower cutoff frequency. 


Suppose G,,(f) has local symmetry around +f,, as detailed in Prob. 10.1-4. Write the 
inverse transforms of G,( f) and G,(f) to show that R,(7) =R,,(7) cos or. Then 
show from Eq. (17) that R,() = R, (7) sin oT. 

Derive Eq. (17) starting from E[n,(t)n,(t — 7)]. 

Derive Eq. (18) starting from Efn, (Hnt — T). 

Let G,(f) have local symmetry around +f,. Prove that R,, (7) = O for all 7 using: 


(a) the correlation relations in Prob. 10.1—12; (b) the spectral relations in Prob. 
10.124. 


Let H&( f) in Fig. 10.1-2 (p. 400) be an ideal BPF with unit gain and let œ = 0, sof, 
is the lower cutoff frequency. Use Eq. (19) to find R,, (7). 
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10.2-1* 


10.22 
10.2-3 
10.2-4 


10.2-5* 


10.2-6 
10.2-7 


10.2-8 


10.2-91 


10.2-10 


10.2-11* 


10.2-12 
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Do Prob. 10.1-16 with œ = 1/4, so the lower cutoff frequency is f. — Bj/4. 


A DSB signal plus noise is demodulated by synchronous detection. Find (S/N)p in 
dB given that Sp = 20 nW, W = 5 MHz, and Jy = 109 ,. 


An AM signal plus noise is demodulated by synchronous detection. Find (S/N), in 
dB given that S, = 0.4, Sp = 20 nW, W = 5 MHz, p = 1, and Jy = 109}. 


A DSB signal plus noise is demodulated by a product detector with phase error $'. 
Take the local oscillator signal to be 2 cos (w,t + p’) and show that (S/N)p = Y cos? $'. 


Rewrite Eqs. (4b) and (5) (p. 408) in terms of y, = S,/NoW, where 5, is the peak 
envelope power of the DSB or AM signal. 


Let x,(t) have quadrature multiplexing as on p. 271, where x(t) and x;(t) are inde- 
pendent signals and x? — x2. Assume an ideal receiver with AWGN and two syn- 
chronous detectors. Find the output signal plus noise for each channel, and express 
(S/N)p in terms of y. 


Explain why an SSB receiver should have a nearly rectangular BPF with bandwidth 
Br — W, whereas predetection filtering is not critical for DSB. 


Modify Eq. (8) (p. 408) for LSSB when |H,(f)|* has the shape in Fig. P10.1-1 with 
f,- f and2b = W. 


Some receivers have additive ‘‘one-over-f” noise with power spectral density G( f) = 
Nof/2\f| for f > 0. Obtain the resulting expressions for (S/N)p in terms of y and W/f. 
for USSB and DSB modulation. Compare your results when W/f. = 1/5 and 1/50. 


When a demodulated signal includes multiplicative noise or related effects, the 
postdetection S/N cannot be defined unambiguously. An alternative performance 
measure is then the normalized mean-square error €? = E([x(r) — Kyp(t)?}/S, 
where K is chosen such that Kyp(t) = x(r) in absence of multiplicative effects. Find 
yp(t) and show that €? = 2[1 — cos 6 ] + 1/y when a USSB signal with AWGN is 
demodulated by a product detector whose local-oscillator signal is 2 cos [c.t + $(1)], 
where $(f) is a slowly drifting random phase. Hint: 2? = x? and xx = —R,(0) 
= 0 since R,(r) is an odd function. 





Explain why an AM receiver should have a nearly rectangular BPF with bandwidth 


Br = 2W for envelope detection, whereas predetection filtering is not critical for 
synchronous detection. 


An AM system with envelope detection is operating at the threshold point. Find the 
power gain in dB needed at the transmitter to get up to (S/N), = 40 dB with full- 
load tone modulation. 


An AM system with envelope detection has (S/N)p = 30 dB under full-load tone- 
modulation conditions with W = 8 kHz. If all bandwidths are increased accordingly, 
while other parameters are held fixed, what is the largest useable value of W? 
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Consider an AM system with envelope detection operating below threshold. Find e? 
defined in Prob. 10.2-9 assuming that yp(£) = y(2, x? = 1, and x = 0. Express your 
answer in terms of y. 


An exponentially modulated signal plus noise has Sg = 10 nW, W = 500 kHz, and 
Ty = 102 ,. Find the value of Np for PM detection, FM detection, and deempha- 
sized FM detection with By. = 5 kHz. 


Suppose an nth-order Butterworth LPF is used for the postdetection filter in an FM 
receiver. Obtain an upper bound on Np and simplify for n 22» 1. 


Find G,(f) when the predetection BPF in an FM receiver has H,(f) as given in Prob. 
10.1—5. Then calculate Np and simplify taking B7 => W. 


An FM signal plus noise has Sp = 1 nW, W = 500 kHz, S, = 0.1, fa = 2 MHz, and 


J'y = 102,. Find (S/N)p in dB for FM detection and for deemphasized FM detec- 
tion with B4, = 5 kHz. 


Obtain an expression for (S/N)p for PM with deemphasis filtering (p. 416). Simplify 
your result taking By. << W. 


The signal x(t) = cos 277200t is sent via FM without preemphasis. Calculate (S/N) 


when fa = 1 kHz, 54 = 500N,, and the postdetection filter is an ideal BPF passing 
100 = |f| = 300 Hz. 


Obtain an expression for (S/N)p for FM with a gaussian deemphasis filter having 


\Hae(f)|? = e^ 4/5), Calculate the resulting deemphasis improvement factor when 
Bee = WIT. 


A certain PM system has (S/N)p = 30 dB. Find the new value of (S/N), when the 
modulation is changed to preemphasized FM with B4, = W/10, while By and all 
other parameters are held fixed. 


Modify Eq. (20) (p. 421) to include deemphasis filtering and rework the calculations 
in Example 10.3-2. 


Obtain an expression like Eq. (20) (p. 421) for PM, and determine the upper bound 
on (S/N )p at threshold. 


Consider the FMFB receiver in Fig. 10.3-7 (p. 422). Let v(t) be a noise-free FM sig- 
nal with deviation ratio D and take the VCO output to be 2 cos [(w, — wt + 


Kóp(0] where p(t) = 2«yp(t). Show that the deviation ratio of the IF signal is 


An analog communication system has x= 1/2, W = 10 kHz, N, = 107? W/Hz, 
and transmission loss L = 100 dB. Calculate $7 needed to get (S/N)p = 40 dB when 
the modulation is: (a) SSB; (b) AM with u = 1 and u = 0.5; (c) PM with $4 = 7; 


(d) FM with D = 1, 5, and 10. Omit deemphasis in the FM case, but check for 
threshold limitations. 


Do Prob. 10.4-1 with x? = 1 and W = 20 kHz. 
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An analog communication system has x? = 1/2, W = 10 KHz, S+ = 10 W, and Ng = 
107? W/Hz. Calculate the path length corresponding to (S/N)p = 40 dB for a trans- 
mission cable with loss factor œ = 1 dB/km when the modulation is: (a) SSB; 
(b) AM with u = 1; (c) FM with D = 2 and 8. 


Do Prob. 10.4—3 for line-of-sight radio transmission at f. = 300 MHz with antenna 
gains of 26 dB at transmitter and receiver. 


A signal with x? = 1/2 is transmitted via AM with u = 1 and (S/N), = 13 dB. If 
the modulation is changed to PM (without deemphasis) and the bandwidths are 
increased accordingly while other parameters remain fixed, what’s the largest usable 
deviation ratio and the resulting value of (S/N)p? 


A frequency-division multiplexing system has USSB subcarrier modulation and FM 
carrier modulation without preemphasis. There are K independent input signals, 
each having bandwidth Wo, and the subcarrier frequencies are f, = (k — DWyk— 
1,2,..., K. The baseband signal at the FM modulator is x,(t) = X a, x,(), where the 
a, are constants and x,(f) is the kth subcarrier signal, each having x? = 1. The 
discriminator at the receiver is followed by a bank of BPFs and synchronous 
detectors. Use the property that the predetection and postdetection S/N are equal for 


SSB to show that the output of the kth channel has (S/N), = fxa?/N, where N, = 
(3k? — 3k + 1)NoWp7/3Sp. 


In the system described in Prob. 10.4—6, the a, are chosen such that x2 = ] and all 


channels have the same output S/N. Obtain expressions for a, and (S/N), under these 
conditions. 


Consider an FM stereo multiplexing system (p. 270) where, for purposes of analy- 
sis, the preemphasis filtering may be done after matrixing and the deemphasis filter- 
ing before matrixing. Let the input signals before preemphasis be x,(f) = x,(¢) + x«() 
and x,(t) = xj (t) — xg(f), and let the deemphasized outputs before matrixing be y,(t) = 
faxi(t) + a(t and y(t) = faxt) + n (t) so the final outputs are y,(f) * y8. 
(a) Show that the power spectral density of n,(t) is |H(P)| (Ny Sg) (f? + £2)II(/2W) 
where fy = 38 kHz, and that n2 >> n? when W = 15 kHz and By, = 2.1 kHz. (b) Tak- 
ing xpxg = 0 and x? = x = 1/3—so that x} ~ 1 if the pilot is small—show that 
(S/N)p for each channel in stereo transmission is about 20 dB less than (S/N), for 
monaural transmission. 


A single-channel PPM signal plus noise has S, = 0.4, S, = 10 nW, W = 500 kHz, 
f, = 1.2 MHz, to 79 = 0.1T,, Br = 10 MHz, and Fy = 109 . Find the value of 
(S/N)p in decibels. 

A single-channel PDM signal plus noise has $, = 0.1, W = 100 Hz, f, = 250 Hz, 


u = 0.2, 79 = T/50, By = 3 kHz, and Jy = 509,. Find the value of Sp so that 
(S/N)p = 40 GB. 


Calculate the upper bound on (S/N)p for PPM baseband transmission with By = 
20W and compare with the actual value when 1j = 0.3T,, 7 = 0.2T,, and f, = 2.5W. 
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Equation (5), p. 428, also holds for each output of a TDM system if we write A? in 
terms of average power in the multiplexed waveform. Use this method to find (S/N) p 
for each channel of a PPM-TDM system having nine voice signals plus a marker, 
S,=1,W=3.2 kHz, f, = 8 kHz, 7 = T/50, B; = 400 kHz, and the marker has dura- 
tion 3r. Express your result in terms of y/9 since there are nine transmitted signals. 


Consider an M-channel TDM-PPM system with guard time T, = 7 = l/Bz and no 


synchronization marker, so Sp = Mf, A?r. Start with Eq. (5), p. 428, to show that 
(SIN) p < (V8)(BUMWY'S,Cy/M ). 
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his chapter launches our study of digital communication systems. We'll first describe the advantages of digital sys- 

tems over analog systems. VVe then focus on baseband transmission to emphasize the generic concepts and 
problems of digital communication, with or without carrier modulation, and to bring out the differences between dig- 
ital and analog transmission. Following an overview of digital signals and systems, we'll analyze the limitations 
imposed by additive noise and transmission bandwidth. We'll also look at practical design considerations such as 
regenerative repeaters, equalization, and synchronization. 

The obvious question we should ask is: what do we gain with digital systems over analog ones? After all, an 
analog system can be implemented with relatively few components, whereas a digital system requires significantly 
more hardware. For example, a simple analog LPF is implemented with a resistor and capacitor. The equivalent dig- 
ital implementation requires an analogto-digital converter (ADC), digital signal processor (DSP, digitalto-analog con- 


verter [DAC], and an LPF for antialiasing. Nonetheless, despite the apparent increase in hardware complexity, we 
gain the following advantages: 


1. Stability. Digital systems are inherently time invariant. Key system parameters are imbedded in algorithms 


that change only if reprogrammed, making for greater accuracy in signal reproduction. With analog hardware, 

the signal and its parameters are subject to change with component aging, external temperatures, and other 

environmental factors. 

Flexibility. Once digital hardware is in place, we have a great deal of flexibility in changing the system. This 

enables us to employ a multitude of signal processing algorithms to more efficiently (a) improve signal fidelity, 

(b) do error correction/detection for data accuracy, [c] perform encryption for privacy and security, (d) employ 

compression algorithms to remove redundancies, and le] allow for multiplexing of various types of signals such as 

voice, picture, video, text, and so on. Furthermore, an algorithm can be easily and remotely modified. 

3. Reliable reproduction. An analog message traveling through a channel becomes degraded by distor- 
tion and noise. While we can employ amplifiers (repeaters) to boost the signal, they amplify the noise as well 
as the signal and can only increase distortion. Thus distortion becomes cumulative. Making a photocopy of 
another photocopy is an example of this phenomenon. As we will see, with digital communication signal repro- 


duction is extremely reliable whether we employ regenerative repeaters for a long haul digital channel or make 
copies of digital audio recordings. 


It should be noted, however, that we can implement much of an analog communication system using digital hard- 


ware and the appropriate ADC and DAC steps, and thereby secure for an analog system many of the advantages 
of a digital sysiem. 


OBJECTIVES 











After studying this chapter and working the exercises, you should be able to do each of the following: 


l. State the advantages of digital over analog communication. 


2. Identify the format of a digital PAM waveform and calculate the mean and variance of the amplitude sequence 


(Sect. 11.1). 


Find and sketch the power spectrum of a digital waveform with uncorrelated message symbols (Sect. 11.1). 


Sketch the conditional PDFs at a digital regenerator, given the noise PDF, and formulate an expression for the 
error probability (Sect. 11.2). 
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5. Calculate the equivalent bit error probability for an optimum M-ary system with a distortionless channel and 
white noise (Sect. 11.2). 


6. Relate the transmission bandwidth, signaling rate, and bit rate for an M-ary system with Nyquist pulse shaping 
(Sect. 11.3). 





7. Determine appropriate parameters for a digital baseband system to satisfy stated specifications (Sects. 11.2 and 
11.3). 


8. Understand when data scrambling and synchronization are appropriate and how they are accomplished (Sect. 
11.4). 


9, Identify the properties of a maximal-length sequence produced from an n-bit shift register with feedback (Sect. 
11.4). 


10. Determine the output sequence from an n-bit shift register with given set of feedback connections and initial con- 
ditions (Sect. 11.4). 








11.1 DIGITAL SIGNALS AND SYSTEMS 


Fundamentally, a digital message is nothing more than an ordered sequence of 
symbols produced by a discrete information source. The source draws from an 
alphabet of M = 2 different symbols, and produces output symbols at some average 
rate r. For instance, a typical computer terminal has an alphabet of M = 90 sym- 
bols, equal to the number of character keys multiplied by two to account for the shift 
key. When you operate the terminal as fast as you can, you become a discrete infor- 
mation source producing a digital message at a rate of perhaps r ~ 5 symbols per 
second. The computer itself works with just M — 2 internal symbols, represented by 
LOW and HIGH electrical states. We usually associate these two symbols with the 
binary digits O and 1, known as bits for short. Data transfer rates within a computer 
may exceed r = 108. 

The task of a digital communication system is to transfer a digital message from 
the source to the destination. But finite transmission bandwidth sets an upper limit 
to the symbol rate, and noise causes errors to appear in the output message. Thus, 
signaling rate and error probability play roles in digital communication similar to 
those of bandwidth and signal-to-noise ratio in analog communication. As prepara- 
tion for the analysis of signaling rate and error probability, we must first develop the 
description and properties of digital signals. 


Digital PAM Signals 


Digital message representation at baseband commonly takes the form of an amplitude- 
modulated pulse train. We express such signals by writing 


x(t) = D a,p(t — kD) n] 


where the modulating amplitude a, represents the kth symbol in the message 
sequence, so the amplitudes belong to a set of M discrete values. The index k ranges 
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from —oo to too unless otherwise stated. Equation (1) defines a digital PAM sig- 
nal, as distinguished from those rare cases when pulse-duration or pulse-position 
modulation 1s used for digital transmission. 


The unmodulated pulse p(t) may be rectangular or some other shape, subject to 
the conditions 


1 =0 
ys » t= xD, +2D,... e 


This condition ensures that we can recover the message by sampling x(t) periodi- 
cally att = KD, K = 0, +1, +2, .. . , since 


x(KD) = S\a,p(KD — kD) = ag 


The rectangular pulse p(t) = II (t/r) satisfies Eq. (2) if r = D, as does any time- 
limited pulse with p(t) = 0 for |t| = D/2. 

Note that D does not necessarily equal the pulse duration but rather the pulse- 
to-pulse interval or the time allotted to one symbol. Thus, the signaling rate is 


rê 1/D [3a] 


measured in symbols per second or baud. In the special but important case of binary 
signaling (M = 2), we write D = T, for the bit duration and the bit rate ist 


rs = 1/T, [35] 


measured in bits per second, abbreviated bps or b/s. The notation T, and r, will be 
used to identify results that apply only for binary signaling. 

Figure 11.1-1 depicts various PAM formats or line codes for the binary mes- 
sage 10110100, taking rectangular pulses for clarity. The simple on-off waveform in 
part a represents each 0 by an “off” pulse (a, = 0) and each 1 by an “on” pulse with 
amplitude a, = A and duration 73/2 followed by a return to the zero level. We there- 
fore call this a return-to-zero (RZ) format. À nonreturn-to-zero (NRZ) format has 

on" pulses for full bit duration T,, as indicated by the dashed lines. Internal com- 
puter waveforms are usually of this type. The NRZ format puts more energy into 
each pulse, but requires synchronization at the receiver because there's no separa- 
tion between adjacent pulses. 

The unipolar nature of an on-off signal results in a dc component that carries no 
information and wastes power. The polar signal in part b has opposite polarity 
pulses, either RZ or NRZ, so its dc component will be zero if the message contains 
1s and Os in equal proportion. This property also applies to the bipolar signal in part 
c, where successive 1s are represented by pulses with alternating polarity. The bipo- 
lar format, also known as pseudo-trinary or alternate mark inversion (AMT), 


+The more common notation R for bit rate risks confusion with autocorrelation functions and with 
information rate defined in Chap. 16. 


be 
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Figure 11.1-1 Binary PAM formats with rectangular pulses. (a) Unipolar RZ and NRZ; 


(b) polar RZ and NRZ; [c] bipolar NRZ; (d) splitphase Manchester; 
le) polar quaternary NRZ. 


eliminates ambiguities that might be caused by transmission sign inversions—a 
problem characteristic of switched telephone links. 

The split-phase Manchester format in part d represents 1s with a positive half- 
interval pulse followed by a negative half-interval pulse, and vice versa for the rep- 
resentation of Os. This format 1s also called twinned binary. It guarantees zero dc 
component regardless of the message sequence. However, it requires an absolute 
sense of polarity at the receiver. 

Finally, Fig. 11.1-1e shows a quaternary signal derived by grouping the mes- 
sage bits in blocks of two and using four amplitude levels to prepresent the four pos- 
sible combinations 00, 01, 10, and 11. Thus, D = 27, and r = r,/2. Different 
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Table 11.1-1 
ay, Natural Code Gray Code 
3A/2 11 10 
. A2 10 1l 
-A/2 01 01 
—3A/2 00 00 





assignment rules or codes may relate the a, to the grouped message bits. Two such 
codes are listed in Table 11.1—1. The Gray code has advantages relative to noise- 
induced errors because only one bit changes going from level to level. 


Quaternary coding generalizes to M-ary coding in which blocks of n message 
bits are represented by an M-level waveform with 


M = 2" [4a] 


Since each pulse now corresponds to n = log, M bits, the M-ary signaling rate has 
been decreased to 


Tp 
Yr a 
log, M 





[4b] 


But increased signal power would be required to maintain the same spacing between 
amplitude levels. 


Transmission Limitations 


Now consider the linear baseband transmission system diagrammed in Fig. 11.1—2a. 
We’ ll assume for convenience that the transmitting amplifier compensates for the 
transmission loss, and we'll lump any interference together with the additive noise. 


After lowpass filtering to remove out-of-band contaminations, we have the signal- 
plus-noise waveform 


y(t) = dd — tg — kD) + n(t) 


where f, is the transmission delay and p(t) stands for the pulse shape with transmis- 
sion distortion. Figure 11.1—2b illustrates what y(t) might look like when x(f) is the 
unipolar NRZ signal in Fig. 11.1—1a. 

Recovering the digital message from y(t) is the task of the regenerator. An aux- 
iliary synchronization signal may help the regeneration process by identifying the 
optimum sampling times 


tk 7 KD + t4 
If p(0) = | then 


y(ty) = ay + Pja,P(KD — kD)  n(ty) [5] 


k*K 
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Figure 11.1-2 (a] Baseband transmission system; (b) signal-plus-noise waveform. 


whose first term is the desired message information. The last term of Eq. (5) is the 
noise contamination at tg, while the middle term represents cross talk or spillover 
from other signal pulses—a phenomenon given the descriptive name inter-symbol 
interference (ISI). The combined effects of noise and ISI may result in errors in the 
regenerated message. For instance, at the sample time tg indicated in Fig. 11.1-2b 
y(tx) is closer to 0 even though a, = A. 

We know that if n(f) comes from a white-noise source, then its mean square 
value can be reduced by reducing the bandwidth of the LPE. We also know that low- 
pass filtering causes pulses to spread out, which would increase the ISI. Conse- 
quently, a fundamental limitation of digital transmission 1s the relationship between 
ISI, bandwidth, and signaling rate. 


This problem emerged in the early days of telegraphy, iud Harry Nyquist 
(1924, 1928a) first stated the relationship as follows: 





The condition r = 2B agrees with our pulse-resolution rule B = 1/27,,, in Sect. 3.4 
if we require p(t) to have duration 7 = D = l/r. 

The second part of Nyquist’s statement is easily proved by assuming that 
r= 2(B + €) > 2B. Now suppose that the message sequence happens to consist of 
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two symbols alternating forever, such as 101010... . The resulting waveform x(t) then 
is periodic with period 2D = 2/r and contains only the fundamental frequency f; = 
B + eand its harmonics. Since no frequency greater than B gets through the channel, 
the output signal will be zero—aside from a possible but useless dc component. 

Signaling at the maximum rate r — 2B requires a special pulse shape, the sinc 
pulse 


p(t) = sinc rt = sinc t/D [6a] 
having the bandlimited spectrum 
TE di 
PUP) epp I [6b] 


Since P(f) = 0 for |f| > r/2, this pulse suffers no distortion from an ideal lowpass 
frequency response with B = r/2 and we can take r = 2B. Although p(?) is not time- 
limited, it does have periodic zero crossings at t = D, +2D, ..., which satisfies 
Eq. (2). (See Fig. 6.1-6 for an illustration of this property.) Nyquist also derived 
other bandlimited pulses with periodic zero crossings spaced by D > 1/2B sor < 
2B, a topic we set aside to pick up again in Sect. 11.3 after discussing noise and 
errors in Sect. 11.2. 

Meanwhile, note that any real channel needs equalization to approach an ideal 
frequency response. Such equalizers often require experimental adjustment in the 
field because we don't know the channel characteristics exactly. Àn important 
experimental display is the so-called eye pattern, which further clarifies digital 
transmission limitations. 

Consider the distorted but noise-free polar binary signal in Fig. 11.1-3a. When 
displayed on a long-persistence.oscilloscope with appropriate synchronization and 
Sweep time, we get the superposition of successive symbol intervals shown in 
Fig. 11.1—3b. The shape of this display accounts for the name “eye pattern." A dis- 
torted M-ary signal would result in M — 1 "eyes" stacked vertically. 

Figure 11.1—4 represents a generalized binary eye pattern with labels identify- 
ing significant features. The optimum sampling time corresponds to the maximum 
eye opening. ISI at this üme partially closes tbe eye and thereby reduces the noise 


Te 
la) [b] 


Figure 11.1-3 (a) Distorted polar binary signal; [b] eye pattern. 
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Figure 11.1-4 Generalized binary eye pattern. 


margin. If synchronization is derived from the zero crossings, as it usually is, zero- 
crossing distortion produces Jitter and results in nonoptimum sampling times. The 
slope of the eye pattern in the vicinity of the zero crossings indicates the sensitivity 


to timing error. Finally, any nonlinear transmission distortion would reveal itself in 
an asymmetric or “squinted” eye. 





Determine the relation between r and B when p(t) = sinc? at. 


EXERCISE 11.1-1 





Power Spectra of Digital PAM 


The pulse spectrum P(f) = #[p(t)] provides some hints about the power spectrum 
of a digital PAM signal x(f). If p(t) = sinc rt, as a case in point, then P(f) in Eq. 
(6b) implies that G,({f) = 0 for |f| > 7/2. However, detailed knowledge of G,(f) 
provides additional and valuable information relative to digital transmission. 


A simplified random digital wave with p(t) = II(t/D) was tackled in Chap. 9. 
Under the conditions 


ot i=k 
Faas. 
Lese] f it k 
we found that G,(f) = o2 D sinc? fD. Now, substituting P(f) = D sinc fD, we write 


GAP) = SIPP m 


This expression holds for any digital PAM signal with pulse spectrum P(f) when the 

| a, are uncorrelated and have zero mean value. 
But unipolar signal formats have a, # O and, in general, we can't be sure that 
the message source produces uncorrelated symbols. A more realistic approach 
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therefore models the source as a discrete stationary random process. Ensemble 
averages of the a, are then given by the autocorrelation function 


Rn) = E(auai-,] [8] 
analogous to writing R,(r) = E[v(t)u(t — 7)] for a stationary random signal v(7). 


The integers n and k in Eq. (8) reflect the time-discrete nature of a digital sequence. 


If a digital PAM signal x(t) has the pulse spectrum P(f) and amplitude autocor- 
relation R,(n), its power spectrum is 


l ed ; 
GLP) = IPP E Radne m? [9 
Despite the formidable appearance of Eq. (9), it easily reduces to Eq. (7) when 


R((0) = o} and R,(n) = 0 for n # 0. In the case of uncorrelated message symbols 
but a, = m, #0, 


2 2 
oj +m n=0 
R = p : [10] 
a(n) Uu n0 
and 
co oo 

> R,(n)e 7"? = o? zs m? Y e PD 

n=- n--oo 


Then, drawing upon Poisson's sum formula, 


e ] 59 n 
-ep L on 
P p, 2:5 (r 5) 


and therefore 


Gf) = o2 rb) + (mar)? S IP(r fe Cf — nr) mi 


n--oo 


Here we have inserted r — 1/D and used the sampling property of impulse multi- 
plication. 

The important result in Eq. (11) reveals that the power spectrum of x(f) contains 
impulses at harmonics of the signaling rate r, unless m, = 0 or P(f) = 0 at all f= 
nr. Hence, a synchronization signal can be obtained by applying x(t) to a narrow 
BPF (or PLL filter) centered at one of these harmonics. We can also calculate the 
total average power x? by integrating G,( f) over all f. Thus, 


x? =o rE, + (mar) > Pan [12] 
n=—00 
where E, equals the energy in p(t). For Eq. (12), and hereafter, we presume the con- 
ditions in Eq. (10) barring information to the contrary. 
The derivation of Eq. (9) starts with the definition of power spectrum from Eq. 
(7), Sect. 9.2. Specifically, we write 
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GJ) Ê dim = ELIT] 


in which X;(f) is the Fourier transform of a truncated sample function x;(t) = x(t) 


for || < T/2. Next, let T = (2K + 1)D so the limit T — co corresponds to 
K- oo. Then, for K > 1, 


xt) = Apre — kD) 


K 
X(f) — PL —jekD 


and 


LAP eco (er J (aem) 


i=—K 


After interchanging the order of expectation and summation we have 


ELXAN] = IPCOF eC) 
with l 
K 
px(f) = » > E[a,aj]e 7*7? 
k=—K =K 
where E[a,a;] = R,(k — i). 
The double summation for p&( f) can be manipulated into the single sum 


ex) = OK +1) S (1- i JR. (eie 


n--2K 2K + | 





Substituting these expressions in the definition of G,(f) finally gives 


Gf) = jin hoo (Ff) Pox(f) 


= JPOP È Reto 


as stated in Eq. (9). 





Consider the Iola Binary RZ acl in Fie 1L. z la, Gere Pu TI(2r,t) so EXAMPLE 11.1-1 
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A2 
lór, 
0 rp 2ry Sry 4r, f 
Figure 11.1-5 Power spectrum of unipolar binary RZ signal. 


If the source bits are equally likely and statistically independent, then 
a, = A/2, a; = A’/2, and Eq. (10) applies with 


Using Eq. (11) we find the power spectrum to be 
A? f A? oo 

G = inc? + 

MEET age ie 2 


n---oo 





(sine z) e(f — nr,) 


which is sketched in Fig. 11.1—5 for f = 0. Notice the absence of impulses at the 
even harmonics because P(nr,) = 0 when n = +2, *4,.... 


Ly 


We could also, in principle, use Eq. (12) to calculate x2, However, it should be 
evident from the waveform that x? = A?/4 when 1s and Os are equally likely. 





Modify the results of Example 11.1—-1 for p(t) = I(r,t), corresponding to an NRZ 
waveform. In particular, show that the only impulse in G,(f) is at f = 0. 


EXERCISE 11.1-2 





Spectral Shaping by Precoding* 
Precoding refers to operations that cause the statistics of the amplitude sequence a; 
to differ from the statistics of the message sequence. Usually, the purpose of pre- 


coding is to shape the power spectrum via R (n), as distinguished from P(f). To 
bring out the potential for statistical spectral shaping, we rewrite Eq. (9) in the form 


Gf) = reco [euo +2 SRA cos Conn) | [13] 


n=l 


having drawn upon the property R,(—n) = R (n). 
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Now suppose that x(t) is to be transmitted over a channel having poor low- 
frequency response—a voice telephone channel perhaps. With appropriate precod- 
ing, we can force the bracketed term in Eq. (13) to equal zero at f — 0 and thereby 
eliminate any dc component in G,(f), irrespective of the pulse spectrum P(f). The 
bipolar signal format back in Fig. 11.1—1c is, in fact, a precoding technique that 
removes dc content. 

The bipolar signal has three amplitude values, a, = +A, 0, and —A. If 1s and Os 
are equally likely in the message, then the amplitude probabilities are 
P(a, = 0) = 1/2 and P(a, = +A) = P(a, = —A) = 1/4, so the amplitude statis- 
tics differ from the message statistics. Furthermore, the assumption of uncorrelated 
message bits leads to the amplitude correlation 


AU2 n=0 
R fn) =\ —A/4 n=1 [14a] 
0 n=2 
Therefore, 
A? 
Gf) = n|PCOF 3 (1 — cos 2mrf/r,) [145] 


= r| PF) A? sin? mf/r, 
which is sketched in Fig. 11.1—6 taking p(t) = TI(r,t). 
Two other precoding techniques that remove dc content are the split-phase 
Manchester format (Fig. 11.1-14) and the family of high density bipolar codes 
denoted as HDBn. The HDBn scheme is a bipolar code that also eliminates long sig- 


nal “gaps” by substituting a special pulse sequence whenever the source produces n 
successive Os. 


Gf) 
A? 
2r, 


0 r, 2r, 


Figure 11.1-6 Power spectrum of bipolar signal. 
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Sketch G,(f) for a bipolar signal with p(t) = sinc r,t. Then use your sketch to show 
that x? — A?/2. 








11.2 | NOISE AND ERRORS 


Here we investigate noise, errors, and error probabilities in baseband digital trans- 
mission, starting with the binary case and generalizing to M-ary signaling. 

We assume throughout a distortionless channel, so the received signal is free of 
ISI. We also assume additive white noise with zero mean value, independent of the 
signal. (Some of these restrictions will be hfted in the next section.) 


Binary Error Probabilities 


Figure 11.2-1 portrays the operations of a baseband binary receiver. The received 
signal plus noise is applied to a lowpass filter whose transfer function has been 
designed to remove excess noise without introducing ISI. A sample-and-hold (S/H) 
device triggered at the optimum times extracts from y(t) the sample values 


Xt) = a, + n(tj) 


Comparing successive values of y(1,) with a fixed threshold level V completes the 
regeneration process. If y(t,) > V, the comparator goes HIGH to indicate a 1; if 
y(t,) < V, the comparator goes LOW to indicate 0. The regenerator thereby acts as 
an analog-to-digital converter, converting the noisy analog waveform y(t) into a 
noiseless digital signal x,(t) with occasional errors. 

We begin our analysis taking x(f) to be a unipolar signal in which a, = A repre- 
sents the message bit 1 and a, = 0 represents the message bit 0. Intuitively, then, the 
threshold should be set at some intermediate level, 0 < V < A. The regeneration 
process is illustrated by the waveforms in Fig. 11.2-2. Errors occur when a, = O but 
a positive noise excursion results in y(t,) > V, or when a, = A but a negative noise 
excursion results in y(t,) < V. 

To formulate the error probabilities, let the random variable Y represent y(t,) at 
an arbitrary sampling time and let n represent n(t,). The probability density function 


GCF) = Ny2 


Regenerator 





Figure 11.2-1 Boseband binary receiver. 
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of Y obviously involves the noise PDF, but it also depends upon the presence or 
absence of the signal pulse. We therefore need to work with conditional probabili- 


ties. In particular, if H denotes the hypothesis or assumption that a, = 0 and Y = n, 
we can write the conditional PDF 


Py(y|Ho) = puCy) [1a] 


where py(n) is the PDF of the noise alone. Similarly, if H, denotes the hypothesis 
that a; = A and Y = A + n, then 


py(y|Hi) = py(y — A) [15] 
obtained from the linear transformation of p(n) with n = y — A. 


Figure 11.2-3 shows typical curves of py(y|Ho) and py(y|H,) along with a pro- 
posed threshold V. The comparator implements the following decision rule: 








(c) 





(error) (error) 


Figure 11.2-2 Regeneration of a unipolar signal. (a] Signal plus noise; (b) S/H output; 


(c) comparator output. 
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(We ignore the borderline case Y = V whose probability of occurrence will be van- 
ishingly small.) The corresponding regeneration error probabilities are then given by 


Pe = P(Y > V|Hp) = | py(y|Ho) dy [2a] 
V 
À y 
Pac V|H;) = | DyCy]Hi) dy [25] 


equivalent to the shaded areas indicated in Fig. 11.2—3. The area interpretation helps 
bring out the significance of the threshold level when all other factors remain fixed. 
Clearly, lowering the threshold reduces P,, and simultaneously increases P,y. Rais- 
ing the threshold has the opposite effect. 

But an error in digital transmission is an error, regardless of type. Hence, the 
threshold level should be adjusted:to minimize tbe average error probability 


P, = PoPa + PAPA [3a] 

where 
Po = P(Ho) P, = P(H,) [3b] 
which stand for the source digit probabilities. The optimum threshold level V,,, must 


opt 
therefore satisfy dP,/dV = 0, and Leibniz’s rule for differentiating the integrals in 
Eq. (2) leads to the general relation 


Popy(Voy|Ho) = Pi DA Vope|1) [4] 


But we normally expect 1s and Os to be equally likely in a long string of message 
bits, so 


Po =P =} P, = (Po + Pa) [Sa] 
and 
Py(Vope|Ho) = Pí VoH) [5b] 


We'll work hereafter with the equally likely condition, unless stated otherwise. 
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Figure 11.2-3 Conditional PDFs with decision threshold and error probabilities. 
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Closer examination of Eq. (Sb) reveals that V,,, corresponds to the point where 
the conditional PDF curves intersect. Direct confirmation of this conclusion is pro- 
vided by the graphical construction in Fig. 11.24 labeled with four relevant areas, 
a, through o4. The optimum threshold yields P,; = o + @2, Peo = a3, and P, — 
$(a, + a; + o5). A nonoptimum threshold such as V < V,» yields P,, = a, and 
Pao = 05 + œ + ay; thus, P, = (ai + o5 + a3 + a4) = P, + $04 > P, 

Next we make the usual assumption that the noise is gaussian with zero mean 
and variance o7, so 


J 2 2 
pan) = se "fie 
200? 


Substituting this gaussian function into Eqs. (1) and (2) gives 


> Vy 
Po = | Pyly)dy = (=) [6a] 
V 
V 
A-V 
Pa = | PO — A)dy = of = ) [6b] 


where Q is the area under the gaussian tail as previously defined in Fig. 8.4-2. Since 
py(n) has even symmetry, the conditional PDFs py(y|Ho) and py(y|H;,) intersect at 
the midpoint and Vay = A/2 when Py = P, = 1. Furthermore, with V = Vopt in Eq. 
(6), Peo = Pa = Q(A/2c) so the optimum threshold yields equal digit error proba- 
bilities as well as minimizing the net error probability. Thus, P, = (Pa + P4) = 


Poo = Pa and 
P. = E 7 
e o( 4) [7] 


which is the minimum net error probability for binary signaling in gaussian noise 
when the source digits are equally likely. 

Based on Eq. (7), the plot of the Q function in Table T.6 can be interpreted now 
as a plot of P, versus A/2o. This plot reveals that P, drops off dramatically when 





Figure 11.2-4 
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A/2o increases. For instance, P, =~ 2 X 107? at A/2o0 = 2.0 whereas P, ~ 107? at 
A/2o = 6.0. 

Although we derived Eq. (7) for the case of a unipolar signal, it also holds in the 
polar case if a, = +A/2 so the level spacing remains unchanged. The only differ- 
ence at the receiver is that V,,, = 0, midway between the two levels. However, the 
transmitter needs less signal power to achieve a specified level spacing in the polar 
signal. 

Let’s bring out that advantage of polar signaling by expressing A in terms of the 
average received signal power Sp. If we assume equal digit probabilities and more- 
or-less rectangular pulses with full-interval duration T,, then Sp = A?/2 for unipolar 
signaling while S, = A?/4 for polar signaling. (These relations should be obvious 
from the NRZ waveforms in Fig. 11.1—1.) Hence, 


e { V 285 Unipolar (8) 
VAS Polar 


and the factor of V2 can make a considerable difference in the value of P 

Since the noise has zero mean, the variance o? equals the noise power Ng at the 
output of the filter. Therefore, we can write A/2o in terms of the signal-to-noise 
power ratio (S/N), = Sp/Ne, namely 


( A J NT. { 3(S/N)e Unipolar 
20 ANg (S/N)g Polar 


But Eq. (9) conceals the effect of the signaling rate r,. In order to pass pulses of 
duration T, = 1/r,, the noise-limiting filter must have By = r,/2, so 


Ne = NoBy = Nors/2 [10] 


Rapid signaling thus requires more signal power to maintain a given error probabil- 
ity P,. 





[9] 


Suppose a computer produces unipolar pulses at the rate r, = 10° bps = 1 Mbps for 
transmission over a noisy system with No = 4 X 1072 W/Hz. The error rate is spec- 
ified to be no greater than one bit per hour, or P, = 1/3600r, = 3 x 10^ 9. Table 
T.6 indicates that we need A/2o = 6.2, and Eqs. (9) and (10) give the correspond- 
ing signal-power requirement 


A 2 

Saee (£) Ng = 1.5 X 107° = 1.5 pW 
2c 

Clearly, any reasonable signal power ensures almost errorless transmission insofar 

as additive noise is concerned. Hardware glitches and other effects would be the 

limiting factors on system performance. 
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Consider a unipolar system with equally likely digits and (S/N)p = 50. Calculate EXERCISE 11.2-1 
Pap Pep and P, when the threshold is set at the nonoptimum value V = 0.44. Com- 
pare P, with the minimum value from Eq. (7). 








Regenerative Repeaters 


Long-haul transmission requires repeaters, be it for analog or digital communica- 
tion. But unlike analog-message repeaters, digital repeaters can be regenerative. If 
the error probability per repeater is reasonably low and the number of hops m is 
large, the regeneration advantage turns out to be rather spectacular. This will be 
demonstrated for the case of polar binary transmission. 

When analog repeaters are used and Eq. 11, Sect. 9.4 applies, the final signal- 
to-noise ratio is (S/N) pz = (1/m)(S/N), and 


P, = EP [11] 
a= Q m\N ; 


where (S/N), is the signal-to-noise ratio after one hop. Therefore, the transmitted 
power per repeater must be increased linearly with m just to stay even, a factor not 
to be sneezed at since, for example, it takes 100 or more repeaters to cross the con- 
i tinent. The 1/m term in Eq. (11) stems from the fact that the contaminating noise 
progressively builds up from repeater to repeater. 
; In contrast, a regenerative repeater station consists of a complete receiver and 
| transmitter back to back in one package. The receiving portion converts incoming 
signals to message digits, making a few errors in the process; the digits are then 
delivered to the transmitting portion, which in turn generates a new signal for trans- 
| mission to the next station. The regenerated signal is thereby completely stripped of 
random noise but does contain some errors. 


| To analyze the performance, let a be the error probability at each repeater, 
namely, 


assuming identical units. As a given digit passes from station to station, it may suf- 
| fer cumulative conversion errors. Íf the number of erroneous conversions 1s even, 
' they cancel out, and a correct digit is delivered to the destination. (Note that this 1s 

true only for binary digits.) The probability of i errors in m successive conversions is 
| given by the binomial frequency function, 


: PÒ = ("Jai - ey" 


Since we have a destination error only when i 1s odd, 
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P, = Pki) = (jaa -— ay + ("Jar - ay. = ma 


i odd 


where the approximation applies for œ << 1 and m not too large. Hence, 


P, mQ (=) [12] 
en VAN, 


so P, increases linearly with m, which generally requires a much smaller power 
increase to counteract than Eq. (11). 

Figure 11.2-5 illustrates the power saving provided by regeneration as a 
function of m, the error probability being fixed at P, = 107°. Thus, for example, a 
10-station nonregenerative baseband system requires about 8.5 dB more transmitted 
power (per repeater) than a regenerative system. 


Matched Filtering 


Every baseband digital receiver—whether at the destination or part of a regenerative 
repeater—should include a lowpass filter designed to remove excess noise without 
introducing ISI. But what’s the optimum filter design for this purpose? For the case 
of timelimited pulses in white noise, the answer is a matched filter. We'll pursue 
that case here and develop the resulting minimum error probability for binary sig- 
naling in white noise. 


Let the received signal be a single timelimited pulse of duration 7 centered at 
time t = kD, so 


x(t) = a,p(t — kD) 


where p(0) = 1, p(t) = 0 for |t| > 7/2, and T = D. Maximizing the output ratio 
(a/o Y at time tp = kD + t, will minimize the error probability. As we learned in 
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Figure 11.2-5 Power saving gained by m regenerative repeaters when P, = 1075. 
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Sect. 9.5, this maximization calls for a matched filter whose impulse response is 
proportional to p(t, — t). In particular, we take 


1 
h(t) = ptg t) [13a] 
Teg 
with 
fue | pXt)dt tg = 7/2 [13b] 


The delay ty = 7/2 is the minimum value that yields a causal impulse response, and 
the proportionality constant 1/7, has been chosen so that the peak output amplitude 
equals a,. The parameter Teq can be interpreted from the property that aT. equals 
the energy of the pulse x(?). 


In absence of noise, the resulting output pulse is y(t) = A(t) * x(t), with peak 
value y(t,) = a, as desired. This peak occurs 7/2 seconds after the peak of x(t). 
Thus, matched filtering introduces an unavoidable delay. However, it does not intro- 
duce ISI at the sampling times for adjacent pulses since y(t) = 0 outside of t, + 7. 
Figure 11.2-6 illustrates these points taking a rectangular shape for p(t), in which 


case Teg = T and y(t) has a triangular shape. 
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Figure 11.2-6 Matched filtering with rectangular pulses. {a} Received pulse; [b] impulse 
response; (c) output pulse. 
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When x(t) is accompanied by white noise, the output noise power from the 
matched filter will be 


0 


2 N p 9 
Ng = o° == IHCF) P df 


wf 2 No 
-—| a(t)? dt = —— [14] 
25) s 2 dn 


This result agrees with the lower bound in Eq. (10) since, for binary signaling, 
Teq = Ty = 1/ry. We'll use this result to evaluate the maximum value of (A/20)? 
and the corresponding minimum binary error probability when the noise 1s white 
and gaussian and the receiver has an optimum filter matched to the timelimited 
pulse shape. 

Consider a binary transmission system with rate r,, average received power Sa, 


and noise density No. We can characterize this system in terms of two new parame- 
ters E, and y, defined by 


E, £ Syr, [15a] 


Yo 2 Sr/Nors = Ey/ No [15b] 


The quantity E, corresponds to the average energy per bit, while y, represents the 


ratio of bit energy to noise density. If the signal consists of timelimited pulses p(s) 
with amplitude sequence a,, then 


oo 
—00 
where a; = A?/2 for a unipolar signal or at = A?/4 for a polar signal. Thus, since 
the output noise power from a matched filter is o? = No/214, we have 


(A/20)? = f E,/ No = y, Unipolar 
2E, No = 2y, Polar 
and Eq. (7) becomes 


pz mo Unipolar f6] 
° Q(V2y,) Polar 


This is the minimum possible error probability, attainable only with matched filtering. 

Finally, we should give some attention to the implementation of a matched fil- 
ter described by Eq. (13). The impulse response for an arbitrary p(r) can be approx- 
imated with passive circuit elements, but considerable design effort must be 
expended to get h(t) ^ O for t > r. Otherwise, the filter may produce significant 
ISI. For a rectangular pulse shape, you can use an active circuit such as the one dia- 
grammed in Fig. 11.2-7a, called an integrate-and-dump filter. The op-amp inte- 
grator integrates each incoming pulse so that y(t,) = a, at the end of the pulse, after 
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x(t) 


y(t) 





(b) 


Figure 11.277 Integrate-and-dump filter. (a) Op-amp circuit; (b) polar M-ary waveforms. 


which the dump switch resets the integrator to zero—thereby ensuring no ISI at sub- 
sequent sampling times. The integrate-and-dump filter is probably the best practical 


implementation of matched filtering. Figure 11.2—7b illustrates its operation with a 
polar M-ary waveform. 


Let x(t) be the unipolar RZ signal in Fig. 11.1-1a. (a) Sketch the corresponding out- 
put waveform from a matched filter and from an integrate-and-dump filter. (b) Con- 


firm that matched filtering yields (A/20)? = y, even though o? = Nor, so 





EXERCISE 11.2-2 









M-ary Error Probabilities 


Binary signaling provides the greatest immunity to noise for a given S/N because it 
has only two amplitude levels—and you can’t send information with fewer than two 
" levels. Multilevel M-ary signaling requires more signal power but less transmission 
| bandwidth because the signaling rate will be smaller than the bit rate of an equiva- 
lent binary signal. Consequently, M-ary signaling suits applications such as digital 
transmission over voice channels where the available bandwidth is limited and the 
signal-to-noise ratio is relatively large. 

Here we calculate M-ary error probabilities in zero-mean gaussian noise. We'll 


take the most common case of polar signaling with an even number of equispaced 
levels at 


a, = tA/2, *3A/2 ,..., *(M — 1)A/2 [17] 
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We'll also assume equally likely M-ary symbols, so that 


P E +P + +P ) [18] 
e M €o € a ey-1 


which is the M-ary version of Eq. (5a). 

Figure 11.2-8 shows the conditional PDFs for a quaternary (M = 4) polar sig- 
nal plus gaussian noise. The decision rule for regeneration now involves three 
threshold levels, indicated in the figure at y = —A, 0, and +A. These are the opti- 
mum thresholds for minimizing P,, but they do not result in equal error probabilities 
for all symbols. For the two extreme levels at a, = +3A/2 we get 


P = E: = Q(A/20) 
whereas 
P, = Pa, = 2Q(A/20) 


because both positive and negative noise excursions produce errors for the inner lev- 
els ata, = +A/2. The resulting average error probability is 


1 A 3 A 
= — X — = —— areal 
Fe 4 so( 4) o( 4) 


or 50 percent greater than binary signaling with the same level spacing. 


The foregoing analysis readily generalizes to an arbitrary even value of M with 
M — 1 decision thresholds at 


M-2 
2 





y=0, +A, + 2A,..., + A [19] 


Then P,, = P, | = Q(A/2o) while the M — 2 inner levels have doubled error prob- 
ability, yielding the average error probability 
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Figure 11.2-8 Conditional PDFs for a quaternary polar signal with gaussian noise. 
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Duc PE x (£) + (M =- 2) x x) [20] 
€ M OO 2 20 


2M — 2 A 1 A 
v M o( +) E ( - ie) 


Equation (20) clearly reduces to Eq. (7) when M = 2, whereas P, = 20(A/2o) 
when M >> 2. 


Next, we relate A/2o to the signal power and noise density assuming a time- 
limited pulse shape p(t) so the average energy per M-ary digit is E, = a? Tq Where 


Tu = | p*(t) dt 


—Cco 


as before. If the M amplitude levels are equally likely and given by Eq. (17), then 


— ] 42 A 2 M2 — 1 
Pix 2 - 1) = 2 [21] 
> (i (4) oo" 





Hence, since Sp = rEy, 


ANS 3 Eg 
"uat ees ^ [22] 
2c M =] Ti^ 


3 1 Sp 6 Sp 
= 5 = 5 
M* — 1 TT eq Ng M —1 Nor. 











where the upper bound corresponds to Ng = No/27,, obtained with matched filter- 
ing. Equations (20) and (22) constitute our final result for error probability in a polar 
M-ary system with gaussian white noise. 

More often than not, M-ary signaling is used to transmit binary messages and 
the value of M is selected by the system engineer to best fit the available channel. 
We should therefore investigate the design considerations in the selection of M 
especially the impact on error probability. But Eqs. (20) and (22) fail to tell the full 
story for two reasons: first, the M-ary signaling rate differs from the bit rate 7,; sec- 
ond, the M-ary error probability differs from the bit error probability. 

We can easily account for the signaling-rate difference when the message bits 
are encoded in blocks of length log; M. Then r, and r are related by 


Epes FE log, M [23] 


from Eq. (4), Sect. 11.1. To relate the M-ary symbol error probabihty P, to the 
resulting error probability per bit, we'll assume a Gray code and a reasonably large 
signal-to-noise ratio. Under these conditions a noise excursion seldom goes beyond 
one amplitude level in the M-ary waveform, which corresponds to just one erro- 
neous bit in the block of log; M bits. Therefore, 


Pre = P,/log, M [24] 
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where P,, stands for the equivalent bit error probability, also called the bit error 
rate (BER). 


Combining Eqs. (23) and (24) with our previous M-ary expressions, we finally 
have 





M-1 A 
P,, ~2 25 
x rr E) [25a] 


in which 





(4) < Ss [255] 
20) ) Mi-1Ny M-i” 


Notice that the upper bound with matched filtering has been written in terms of 
Ye = Sp/Nor, = Sp/(Nor log; M). This facilitates the study of M-ary signaling as a 
function of energy per message bit. 


Suppose the channel in question has a fixed signaling rate 7 = 3000 baud = 3 
kbaud and a fixed signal-to-noise ratio (S/N)p = 400 ~ 26 dB. (These values 
would be typical of a voice telephone channel, for instance.) We'll assume matched 
filtering of NRZ rectangular pulses, so r7,4 = 1 and 





AW 3 6log, M 
2)])= SiS 
(+) a (S/N) = Sey, 


which follow from Eqs. (22) and (25b). 

Binary signaling yields a vanishingly small error probability when 
(S/N), = 400, but at the rather slow rate r, = r = 3 kbps. M-ary signaling increases 
the bit rate, per Eq. (23), but the error probability also increases because the spacing 
between amplitude levels gets smaller when you increase M with the signal power 


held fixed. Table 11.2—1 brings out the trade-off between bit rate and error probabil- 
ity for this channel. 














Table 11.2-1 M-ary signaling with r= 3 kilobaud and (S/N), = 400 
M ry (kbps) ANO P, 
2 3 20.0 3 x 1078 
4 6 8.9 1 x 1075 
8 9 4.4 4 X 1075 
16 12 22 7 X 1073 
32 15 1.1 6 X 107? 
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Table 11.2-2 Mary signaling with rj = 9 kbps and Phe = 4 x 10-75 
M r (kbaud) Ys 
2 9.00 10 
4 4.50 24 
8 3.00 67 
16 2:25 200 
32 1.80 620 





Another type of trade-off is illustrated by Table 11.2-2, where the bit rate and 
error probability are both held fixed. Increasing M then yields a lower signaling rate 
r—implying a smaller transmission bandwidth requirement. However, the energy 
per bit must now be increased to keep the error probability unchanged. Observe that 
going from M = 2 to M = 32 reduces r by 1/5 but increases y, by more than a fac- 


tor of 60. This type of trade-off will be reconsidered from the broader viewpoint of 
information theory in Chap. 16. 





Consider the three-level bipolar binary format in Fig. 11.1—1c with amplitude prob- 
abilities P(a, = 0) = 1/2 and P(a, = +A) = Pla, = —A) = 1/4. Make a sketch 
similar to Fig. 11.2-8 and find P, in terms of A and o when the decision thresholds 
are at y = +A/2. Then calculate Sp and express P, in a form like Eq. (16). 


EXERCISE 11.2-3 














| 11.3 BANDLIMITED DIGITAL PAM SYSTEMS 


This section develops design procedures for baseband digital systems when the 
transmission channel imposes a bandwidth limitation. By this we mean that the 
| available transmission bandwidth is not large compared to the desired signaling rate 
and, consequently, rectangular signaling pulses would be severely distorted. Instead, 
we must use bandlimited pulses specially shaped to avoid ISI. 
| Accordingly, we begin with Nyquist’s strategy for bandlimited pulse shaping. 
Then we consider the optimum terminal filters needed to minimize error probability. 
The assumption that the noise has a gaussian distribution with zero mean value will 
| be continued, but we'll allow an arbitrary noise power spectrum. We'll also make 
ij allowance for linear transmission distortion, which leads to the subject of equaliza- 
tion for digital systems. The section closes with an introductory discussion of cor- 
| relative coding techniques that increase the signaling rate on a bandlimited channel. 


Nyquist Pulse Shaping 


Our presentation of Nyquist pulse shaping will be couched in general terms of M-ary 
signaling with M = 2 and symbol interval D = 1/r. In order to focus on potential ISI 
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problems at the receiver, we'll let p(t) be the pulse shape at the output of the receiv- 
ing filter. Again assuming that the transmitter gain compensates for transmission loss, 
the output waveform in absence of noise is 


y(t) = 2,2 — ta — kD) 


As before, we want p(t) to have the property 
l t=0 
t) = 
Pte) lo PSA De ve 


which eliminates ISI. Now we impose the additional requirement that the pulse 
spectrum be bandlimited, such that 


P(f) = 0 "ES: T 

where i 

B-l-cg Qs 
2 2 


This spectral requirement permits signaling at the rate 
r-—J2(B-B) Bsr=2B [2] 


in which B may be interpreted as the minimum required transmission bandwidth, so 
that By = B. 


Nyquist’s vestigial-symmetry theorem states that Eq. (1) is satisfied if p(t) has 
the form 


P(t) = pglt) sinc rt [3a] 
with 
F[palt)] = Pef) = 0 MIB (3bl 
p(0) = | Pols) a= 


Clearly, p(t) has the time-domain properties of Eq. (1a). It also has the frequency- 
domain properties of Eq. (1b) since 


PF) = PS) «(Q/7)(7r)] 

and the convolution of two bandlimited spectra produces a new bandlimited spectrum 
whose bandwidth equals the sum of the bandwidths, namely, B = B + r/2. Usually 
we take p,(z) to be an even function so P,(f) is real and even; then P(f) has vestig- 
ial symmetry around f = +r/2, like the symmetry of a vestigial sideband filter. 

Infinitely many functions satisfy Nyquist’s conditions, including the case when 
pelt) = 1 so B = O and p(t) = sinc rt, as in Eq. (6), Sect. 11.1. We know that this 
pulse shape allows bandlimited signaling at the maximum rate r = 2B. However, 
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synchronization turns out to be a very touchy matter because the pulse shape falls 


off no faster than 1/|t| as |t| — co. Consequently, a small timing error e results in 
the sample value 


y(ty) = ag sinc re + Ma, sinc (KD — kD + re) 
k*K 
and the ISI in the second term can be quite large. 
Synchronization problems are eased by reducing the signaling rate and using 
pulses with a cosine rolloff spectrum. Specifically, if 


M ane E (Z f ) 
PKf) = geo S 28 II 2B [4a] 
then 
l r 
E Here 
1 r r 
=< > —(|f| - = —— —+ 8 4 
POF) = A c eos lll Penes (abi 
T 
0 ie oe 
and the corresponding pulse shape is 
cos 2mpt . 
———À 5 
p(t) L— (4B) sinc rt [5] 


Plots of P(f) and p(t) are shown in Fig. 11.3~1 for two values of B along with B = 
0. When f > 0, the spectrum has a smooth rolloff and the leading and trailing oscil- 
lations of p(t) decay more rapidly than those of sinc rt. 


E B = r2 (100%) 


B = rl4 (50%) 
p=0 







Figure 11.3-1 Nyquist pulse shaping. (a) Spectra; (b) waveforms. 
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Figure 11.3-2 Baseband waveform for 10110100 using Nyquist pulses with B = r/2. 


Further consideration of Eqs. (4) and (5) reveals two other helpful properties of 


p(t) in the special case when B = r/2, known as 100 percent rolloff. The spectrum 
then reduces to the raised cosine shape 


— 1 saf = ES ols nf < 
P(f) = 7 cos Zath ] cos (Z£) | lflsr [6a] 
and 
sinc 2rt 
pa) = TL any (65) 


The half-amplitude width of this pulse exactly equals the symbol interval D, that is, 
p(+0.5 D) = 1/2, and there are additional zero crossings att = +1.5D,+2.5D,.... 
A polar signal constructed with this pulse shape will therefore have zero crossings 
precisely halfway between the pulse centers whenever the amplitude changes polar- 
ity. Figure 11.3-2 illustrates this feature with the binary message 10110100. These 
zero crossings make it a simple task to extract a synchronization signal for timing 
purposes at the receiver. However, the penalty is a 50 percent reduction of signaling 
speed since r = B rather than 2B. Nyquist proved that the pulse shape defined by Eq. 
(6) is the only one possessing all of the aforementioned properties. 


Sketch P(f) and find p(t) for the Nyquist pulse generated by taking 





Pf) = (2/r) A(2f/r). Compare your results with Eq. (6). 


Optimum Terminal Filters 


Having abandoned rectangular pulses, we must likewise abandon the conventional 
matched filter and reconsider the design of the optimum receiving filter that mini- 


mizes error probability. This turns out to be a relatively straightforward problem 
under the following reasonable conditions: 


1. The signal format is polar, and the amplitudes a, are uncorrelated and equally 
likely. 





T D. 
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GG 
| Sr | 
x(t) Ar(f) HcCf) o HKF) EG) 
= 


Figure 11.3-3 


2. "The transmission channel is linear but not necessarily distortionless. 


The filtered output pulse p(Z) is to be Nyquist shaped. 


Go 


The noise is additive and has a zero-mean gaussian distribution but may have a 
nonwhite power spectrum. 


To allow for possible channel distortion and/or nonwhite noise, our optimization 
must involve filters at both the transmitter and receiver. As a bonus, the source wave- 
form x(t) may have a more-or-less arbitrary pulse shape p,(2). 

Figure 11.3-3 lays out the system diagram, including a transmitting filter func- 


tion H4(f), a channel function H-(f), and a receiving filter function H,(f). The 
input signal has the form 


x(t) = X, apt — kD) [Zal] 
k 
and its power spectrum is 
GAF) = oarlP.(f)/ U bi 
| where P(f) = [p,(6)] and 
i. 2 


12 
| These relations follow from Eq. (12), Sect. 11.1, and Eq. (21), Sect. 11.2, with our 
stated conditions on a,. Thus, the transmitted signal power will be 


2.2. oo 
Sr = | IH CF) o.) df = x] IH F)PAG) P df [8] 


| a result we'll need shortly. 
At the output of the receiving filter we want the input pulse p,(t) to produce a 


Nyquist-shaped pulse p(t — tz), where tg represents any transmission time delay. 
The transfer functions in Fig. 11.3—3 must therefore obey the relationship 


PAA)AAS AAP) Half) = P(f)e 7" [9] 
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so both terminal filters help shape p(t). But only the receiving filter controls the out- 
put noise power 


Ng 2 c? = | AS) GC) df [10] 


where G,( f) is the noise power spectrum at the input to the receiver. 

Equations (7)-(10) constitute the information relevant to our design problem. 
Specifically, since the error probability decreases as A/2o increases, we seek the 
terminal filters that maximize (A/20 )? subject to two constraints: (1) the transmitted 
power must be held fixed at some specified value Sy, and (2) the filter transfer func- 
tions must satisfy Eq. (9). 

We incorporate Eq. (9) and temporarily eliminate H,(f) by writing 





P(f 
Ol = REDE 
Then we use Eqs. (8) and (10) to express (4/20 )? as 
A WV 35; 1 
ae (M? = 1)r las = 
where 
m | VARGA) d CO af tab 


Maximizing (A/20 )? thus boils down to minimizing the product of integrals Jp, in 
which H;(/) is the only function under our control. 

Now observe that Eq. (12b) has the form of the right-hand side of Schwarz's 
inequality as stated in Eq. 17, Sect. 3.6. The minimum value of Jp therefore occurs 


when the two integrands are proportional. Consequently, the optimum receiving fil- 
ter has 


sIPC) aaa 
VGAF)IAAF)| 


where g is an arbitrary gain constant. Equation (11) then gives the optimum trans- 
mitting filter characteristic 


[Half ) |? = 


DOPO) 
aPC PACA 


These expressions specify the optimum amplitude ratios for the terminal filters. 
Note that the receiving filter deemphasizes those frequencies where G,(f) is large, 
and the transmitting filter supplies the corresponding preemphasis. The phase shifts 
are arbitrary, providing that they satisfy Eq. (9). 

Substituting Eq. (13) into Eq. (12) yields our final result 


IP) 





[135] 
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[14] 


A 85 | li IPCOTV GF) “| 


20 max 7 (M* E l)r —oo lIH-(f)| 


from which the error probability can be calculated using Eq. (20), Sect. 11.2. As a 
check of Eq. (14), take the case of white noise with G,(f) = No/2 and a distortion- 
less channel with transmission loss L so |E f) = 1/L; then 


CONI ED | PL] 


—oco 


But since S;/L = S4 and Nyquist-shaped pulses have 


[pula 


we thus obtain 








(4) | 6 Sgp  6log;M 
2e]... M?—-1Nr Mi-1 


which confirms that the optimum terminal filters yield the same upper bound as 
matched filtering—see Eqs. (22) and (25), Sect. 11.2. 





Consider a system with white noise, transmission loss L, and a distortionless chan- EXAMPLE 11.3-1 
nel response over |f| = Br where B; = r. This transmission bandwidth allows us to 

use the pulse shape p(t) in Eq. (6), thereby simplifying synchronization. Simplicity 

also suggests using the rectangular input pulse p,(t) = Il(t/r) with r = 1/r, 

so P,(f) = 7 sinc fr. Taking the gain constant g in Eq. (13) such that |H,(0)| = 1, 

we have 


l Tf cos (mf/2r) 
— — = ee < 
MKf)-cs ES VL ger 
as plotted in Fig. 11.3—4. Notice the slight high-frequency rise in |H{f)| compared 
to |H(f) |. If the input pulses have a small duration r << 1/r, then this rise becomes 
negligible and |H;(f)| ~ |H&(f)], so one circuit design serves for both filters. 






Hz) Xx 2 
TM VE 


IHR) 
Figure 11.3-4 

Amplitude ratio of optimum 
filters in Example 11.3-1. 0 
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EXERCISE 11.3-2 Carry out the details going from Eq. (12) to Eqs. (13) and (14). 





Equalization 


Regardless of which particular pulse shape has been chosen, some amount of resid- 
ual ISI inevitably remains in the output signal as a result of imperfect filter design, 
incomplete knowledge of the channel characteristics, and so on. Hence, an 
adjustable equalizing filter is often inserted between the receiving filter and the 
regenerator. These “mop-up” equalizers usually have the structure of a transversal 
filter previously discussed in Sect. 3.2 relative to linear distortion of analog signals. 
However, mop-up equalization of digital signals involves different design strategies 
that deserve attention here. 

Figure 11.3—5 shows a transversal equalizer with 2N + 1 taps and total delay 
2ND. The distorted pulse shape p(t) at the input to the equalizer is assumed to have 
its peak at t = 0 and ISI on both sides. The equalized output pulse will be 

N 
Peg(t) = eP — nD — ND) 


new-— 


and sampling at t, = kD + ND yields 


N 


N 
Pelt) = 2 Cn PKD — nD) = YXoBP,, [15] 
n=—N N 


n-— 


where we've introduced the shorthand notation p,_ = p|(k — n)D]. Equation (15) 
thus takes the form of a discrete convolution. 


Total delay 2ND 
AK 


o_O ———À 


eet ie 
© (+) (+ }——> Peal) 


Figure 11.3-5 Transversal equalizer with 2N + | taps. 
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Ideally, we would like the equalizer to eliminate all ISI, resulting in 


1 k=0 
Peg te) z 0 kÆ0 


But this cannot be achieved, in general, because the 2N + 1 tap gains are the only 
variables at our disposal. We might settle instead for choosing the tap gains such that 





(t,) = { : E [16] 
i 0 k = £1, +2,..., EN 
thereby forcing N zero values on each side of the peak of p,,(t). The corresponding 
| tap gains are computed from Eqs. (15) and (16) combined in the matrix equation 
Po ty D C.N 0 
p N-1 P weg || emt 0 
| Py e Py Ge de [17] 
Py P wu e 0 
Dus xe Pp Cy 0 


Equation (17) describes a zero-forcing equalizer. This equalization strategy is opti- 
mum in the sense that it minimizes the peak intersymbol interference, and it has the 
| added advantage of simplicity. Other optimization criteria lead to different strategies 
with more complicated tap-gain relationships. 
When the transmission channel is a switched telephone link or a radio link with 
| slowly changing conditions, the values of p, will not be known in advance. The tap 
gains must then be adjusted on-line using a training sequence transmitted before the 
actual message sequence. An adaptive equalizer incorporates a microprocessor for 
| automatic rather than manual tap-gain adjustment. More sophisticated versions 
adjust themselves continuously using error measures derived from the message 
sequence. Further details regarding adaptive equalization are given by Haykin 
| (2001, Sect. 4.1) and Proakis (2001, Chap. 11). 
Mop-up equalization, whether fixed or adaptive, does have one hidden catch in 
that the equalizer somewhat increases the noise power at the input to the regenerator. 
| This increase is usually more than compensated for by the reduction of ISI. 


Suppose a three-tap zero forcing equalizer is to be designed for the distorted pulse EXAMPLE 11.3-2 
plotted in Fig. 11.3-6a. Inserting the values of D, into Eq. (17) with N = 1, we have 





—— 
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1.0 0.1 0.0 || c, 0 
2. 0.2 1 0 O. 1 Co = 1 
0.1 -02 E Ee 0 
Therefore, 


c,-7-—0096  c-096 4-202 


and the corresponding sample values of p,,(t) are plotted in Fig. 11.3-6b with an 
interpolated curve. As expected, there is one zero on each side of the peak. However, 
zero forcing has produced some small ISI at points further out where the unequal- 
ized pulse was zero. 


p(t) 





Figure 11.3-6 (a) Distorted pulse; (b) equalized pulse. 





Correlative Coding * 


Correlative coding, also known as partial-response signaling, is a strategy for band- 
limited transmission at r = 2B that avoids the problems associated with p(t) = sinc rt. 


Las dake 
E 





ds EA us E E — 
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The strategy involves two key functional operations, correlative filtering and digital 
precoding. Correlative filtering purposely introduces controlled intersymbol interfer- 
ence, resulting in a pulse train with an increased number of amplitude levels and a cor- 
related amplitude sequence. Nyquist's signaling-rate limitation no longer applies 
because the correlated symbols are not independent, and therefore higher signaling 
rates are possible. Digital precoding of the message sequence before waveform gener- 
ation facilitates message recovery from the correlative-filtered pulse train. 

Figure 11.3—7a shows the general model of a transmission system with correla- 
tive coding, omitting noise. The digital precoder takes the sequence of message 
symbols m, and produces a coded sequence m, applied to an impulse generator. 
(In practice, the impulses would be replaced by short rectangular pulses with dura- 
tion T << D.) The resulting input signal is an impulse train 


x(t) = M a,6(t — kD) 


whose weights a, represent the mg. The terminal filters and channel have the overall 
transfer function H( f), producing the output waveform 


y(t) = Ma,h(t — kD) [18] 


where h(t) = ¥7'[A(f)]. 

Although a correlative filter does not appear as a distinct block in Fig. 11.3—7a, 
the transfer function H(f) must be equivalent to that of Fig. 11.3-7b—which con- 
sists of a transversal filter and an ideal LPF. The transversal filter has total delay ND 












Digital 
precoder 









Impulse 
generator 


H(f)- x 
HC PAAR) 





Regen 





la) 
Total ee ae ND 


tes 


{b} 







Ideal LPF 
B = rl2 


Figure 11.3-7 (o) Transmission system with correlative coding; [b] equivalent correlative filter. 
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and N + | tap gains. Since the impulse response of the LPF is sinc rt andr = 1/D, 
the cascade combination yields the overall impulse response 


N 
A(t) = X c,sinc (rt — n) [19] 
n=O 
Hence, Eq. (18) becomes 
N 
y(t) = Sa c, Sine. (rt — n — k) - [20a] 
k n-0 


= Ma,sinc (rt — k) 


k 
where 


N 
ak £ Cody + cya, 4 + c + Cyayy = param [20b] 
n=0 
Message regeneration must then be based on the sample values y(t,) = a;. 
Equation (20) brings out the fact that correlative filtering changes the amplitude 
sequence a, into the modified sequence a,. We say that this sequence has a correla- 
tion span of N symbols, since each a; depends on the previous N values of a,. 
Furthermore, when the a, sequence has M levels, the ag sequence has M' > M lev- 
els. To demonstrate that these properties of correlative filtering lead to practical 
bandlimited transmission at r — 2B, we must look at a specific case. 
Duobinary signaling is the simplest type of correlative coding, having M — 2, 


N = 1, and cy = c, = 1. The equivalent correlative filter is diagrammed in Fig. 11.3-8 
along with its impulse response 


h(t) = sinc r,t + sinc (r,t — 1) [214] 


and the magnitude of the transfer function 
2 mf .. 
H(f) = 2. cos he thn \f| = n/2 [215] 
rp Tb ; 


The smooth rolloff of H(f) is similar to the spectrum of a Nyquist-shaped pulse and 
can be synthesized to a good approximation. But, unlike Nyquist pulse shaping, 
duobinary signaling achieves this rolloff without increasing the bandwidth require- 
ment above B = r,/2. In exchange for the signaling rate advantage, a duobinary 
waveform has intentional ISI and M' = 3 levels—attributed to the property of the 
impulse response that A(t) = 1 at both £ = O and t = T}. 


To bring out the ISI effect, let the amplitude sequence a, be related to the pre- 
coded binary message sequence m, by 


+ A/2 m, = 1 


[22] 
— A/2 my 


a, = (m, — 1/2)A = { 


ll 
o 








b 
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Delay 
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2/r, 





-ry2 0 ry2 i 
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Figure 11.3-8 Duobinary signaling. (a) Equivalent correlative filter; (b) impulse response; 


(c) amplitude ratio. 


which is equivalent to a polar binary format with level spacing A. Equation (20) then 
gives the corresponding output levels 


y(t) = a, = a, + a, 4, = (my, + my, — 1)A [23a] 


+A m,-m,,71 


—A m, 0 


Il 
ll 


L 
Py, 


In principle, you could use Eq. (23b) to recover mj from y(t,) if you have previously 
recovered mg. However, when noise causes an erroneous value of m; .,, all subse- 


quent message bits will be in error until the next noise-induced error—a phenome- 
non called error propagation. 
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The digital precoder for duobinary signaling shown in Fig. 11.3—9a prevents 
error propagation and makes it possible to recover the input message sequence m, 
from y(t,). The precoder consists of an exclusive-OR gate with feedback through a 
D-type flip-flop. Figure 11.3—9b lists the coding truth table along with the algebraic 
sum m, + m; that appears in Eq. (23a). Substitution now yields 


+A m, = 0 
y(t.) = { 0 x 4 [24] 
k 


which does not involve m,_, thanks to the precoder. When y(?) includes additive 
gaussian white noise, the appropriate decision rule for message regeneration is: 





This rule is easily implemented with a rectifier and a single decision threshold set at 
A/2. Optimum terminal filter design gives the minimum error probabihty 


3 = 
Bs- o{ ZV 2) [25] 


which is somewhat higher than that of a polar binary system. 

When the transmission channel has poor dc response, modified duobinary sig- 
naling may be employed. The correlative filter has N = 2, cg = 1, c, = 0, and 
c = — 1, so that 


2j 27r 
AA) = sin s fien 6l 
b b 


Figure 11.3-10 shows |H(f)| and the block diagram of the correlative filter. The 
precoder takes the form of Fig. 11.3-9 with two flip-flops in series to feed back 


Exclusive-OR 
gate 


D 
= 


D-type 
flip-flop 


my, 





, 
Mk-i 





la] (b) 
Figure 11.3-9 (aj Digital precoder for duobinary signaling; (b) truth table. 
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Figure 11.3-10 Correlative filter for modified duobinary signaling. {a} Amplitude ratio of 
correlative filter; (b) block diagram. 


my 5. If the pulse generation is in accordance with Eq. (22), then 


y(&) = a, — ay; = (m, — mA | R 


2 0 m=0 
+A m= 1 


which can be compared with Eqs. (23) and (24). 





Consider a duobinary system that uses the precoding of Fig. 11.3—9 and has binary 
message m, = 000010011000. For the purposes of initial conditions, we set the ini- 
tial values of my_, and aj. , to 0 and —A, respectively. Table 11.3—1 shows the mes- 
sage input, m,; the precoded output, m,; the input to the correlative filter, a, with 
A = 2; and finally the correlative filter output, y(t,). Note that if we apply the rule of 
Eq. (24), we will recover the original message from y(t,) > m,. Now let's consider 
the modified duobinary system of Fig. 11.3-10 that includes precoding with two 
flip-flops. Here we set the initial values of m;. ; and a,_, to 0 and —A, respectively. 
The results are shown in Table 11.3-2. 


Table 11.3-1 Duobinary signaling example with precoding 





m, 0 0 0 0 1 0 0 1 P 0 0 0 
mia 0 0 0 0 0 1 1 1 0 1 1 1 
mj 0 0 0 0 l 1 1 0 Er 4 1 1 
a, -1 -1 -l -l 1 1 1 0-1 EE: 1 1 
Qa -1 epo ~] -1 1 l |) -1 1 1 1 
i -2 -2 -2 -2 0 2 2 0 0 2 2 2 
rity 0 0 0 0 1 0 0 1 1 0 0 0 


EXAMPLE 11.3-3 
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Table 11.3-2 Modified duobinary signaling example with precoding 











m 0 0 0 0 1 0 0 1 1 cB 0 0 
mia 0 0 0 0 0 1 0 1 1 0 1 
mi 0 0 0 1 0 1 1 0 1 0 1 
à, - =b =~ =] 1 -l 1 1 -1 1 -1 1 
dj -1 =l 1 1 1 1 1 -1 1 1 j^ - a 
xt) -2 -2 -2 ~2 0 -2 2 0 0 2 -2 2 
Pu 0 0 0 0 1 0 0 1 01 0 0 0 











Construct a truth table like Fig. 11.3—9b for the case of modified duobinary signal- 
ing and use it to obtain Eq. (27). 














11.4 SYNCHRONIZATION TECHNIQUES 


Synchronization is the art of making clocks tick together. The clocks in a digital com- 
munication system are at the transmitter and receiver, and allowance must be made 
for the transmission time delay between them. Besides symbol synchronization, most 
systems also require frame synchronization to identify the start of a message or vari- 
ous subdivisions within the message sequence. Additionally, carrier synchronization 
is essential for digital transmission with coherent carrier modulation—a topic to be 
discussed in Chap. 14. 

Here we'll consider symbol and frame synchronization in baseband binary sys- 
tems. Our attention will be focused on extracting synchronization from the received 
signal itself, rather than using an auxiliary sync signal. By way of an overview, 
Fig. 11.4-1 illustrates the position of the bit synchronizer relative to the clock and 
regenerator. Framing information is usually derived from the regenerated message and 
the clock, as indicated by the location of the frame synchronizer. We’ ll look at typi- 
cal synchronization techniques, along with the related topics of shift-register opera- 
tions for message scrambling and pseudonoise (PN) sequences for framing purposes. 

Our coverage of synchronization will be primarily descriptive and illustrative. 
Detailed treatments of digital synchronization with additive noise are given by Men- 


gali and D’ Andrea (1997). 
Output message 
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Figure 11.4-1 Synchronization in a binary receiver. 
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Bit Synchronization 


Bit-sync generation becomes almost trivial when y(t) has unipolar RZ format so its 
power spectrum G,( f) includes 6(f + r,), like Fig. 11.1—5. A PLL or narrow BPF 
tuned to f = r, will then extract a sinusoid proportional to cos (27r,t + $), and 
phase adjustment yields a sync signal for the clock. The same technique works with 
a polar format if y(t) is first processed by a square-law device, as diagrammed in 
Fig. 11.4-2a. The resulting unipolar waveform y(t) shown in Fig. 11.4-2b now 
has the desired sinusoidal component at f = r,. Various other nonlinear polar-to- 
unipolar operations on y(t) achieve like results in open-loop bit synchronizers. 

But a closed-loop configuration that incorporates the clock in a feedback loop 
provides more reliable synchronization. Figure 11.4—3 gives the diagram and 
explanatory waveforms for a representative closed-loop bit synchronizer. Here a 
zero-crossing detector generates a rectangular pulse with half-bit duration 7,/2 
starting at each zero-crossing in y(t). The pulsed waveform z(t) then multiplies the 
square-wave clock signal c(f) coming back from the voltage-controlled clock 
(VCC). The control voltage v(f) is obtained by integrating and lowpass-filtering the 
product z(f)c(t). The loop reaches steady-state conditions when the edges of c(t) and 
z(t) are synchronized and offset by 7,/4, so the product has zero area and u(t) 
remains constant. Practical implementations of this system usually feature digital 
components in place of the analog multiplier and integrator. 

Both of the foregoing techniques work best when the zero-crossings of y(t) are 
spaced by integer multiples of T,. Otherwise, the synchronization will suffer from 


cos (2m7,t + $) 


y(t) y(t) BPF Phase To clock 
fo^ To adj 


{a} 


ei — c ME ec 
j T 
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cos 2arryt 
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Figure 11.4-2 Bit synchronization by polor to unipolar conversion. (c) Block diogram; 


(b} waveforms. 
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xu) Zero crossing 
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Figure 11.4-3 Closed-loop bit synchronization with a voltage-controlled clock. (a) Block dia- 


gram; (b) waveforms. 


timing jitter. An additional problem arises if the message includes a long string of 1s 
or Os, so y(t) has no zero-crossings, and synchronism may be lost. Message scram- 
blers discussed shortly help alleviate this problem. 

A. different approach to synchronization, independent of zero-crossings, relies 
on the fact that a properly filtered digital signal has peaks at the optimum sampling 


times and is reasonably symmetric on either side. Thus, if t, is synchronized and 
8 < T,/2, then 


Ixte — 6)| ~ y(t, + 5)| < |y(z,)| 


However, a late sync signal produces the situation shown in Fig. 11.4-4a where 
ly(t, — 5)| > |y(, + 8), while an early sync signal would result in 
| y(t, — 8)| < | y(t, + 8)|. The early-late synchronizer in Fig. 11.4-4b uses these 
properties to develop the control voltage for a VCC in a feedback loop. A late sync 


signal results in v(t) = | y(t, — 8)| — | y(t, + 8)| > 0, which speeds up the clock, 
and conversely for an early sync signal. 
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Figure 11.4-4 Early-late bit synchronization. [ad] Waveform; [b] block diagram. 


| Scramblers and PN Sequence Generators 


Scrambling is a coding operation applied to the message at the transmitter that 
| "randomizes" the bit stream, eliminating long strings of like bits that might impair 
receiver synchronization. Scrambling also eliminates most periodic bit patterns that 
could produce undesirable discrete-frequency components (including dc) in the 
| power spectrum. Needless to say, the scrambled sequence must be unscrambled at 
the receiver so as to preserve overall Dit sequence transparency. 
Simple but effective scramblers and unscramblers are built from tapped shift reg- 
| isters having the generic form of Fig. 11.4—5, the digital counterpart of a tapped delay 
line. Successive bits from the binary input sequence b, enter the register and shift 
from one stage to the next at each tick of the clock. The output b; is formed by com- 
bining the bits in the register through a set of tap gains and mod-2 adders, yielding 


b, = abr- D oob, O ^ O o bi, 
The tap gains themselves are binary digits, so a, = 1 simply means a direct connec- 


tion while a, = 0 means no connection. The symbol © stands for modulo-2 addi- 
tion, defined by the properties 


[1] 
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Figure 11.4-5 Tapped shift register. 
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Figure 11.4-6 (a) Binary scrambler; [b] unscrambler. 3 
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and 
b, Bb, © b, = (b, 5) © b = b, DB (b; by) [2b] 


where b,, b, and b, are arbitrary binary digits. Mod-2 addition is implemented with 
exclusive-OR gates, and obeys the rules of ordinary addition except that 101 = 0. 
Figure 11.4—6 shows an illustrative scrambler and unscrambler, each employing 
a 4-stage shift register with tap gains o, = a, = 0 and o4 = a, = 1. (For clarity, 
we omit the clock line here and henceforth.) The binary message sequence m, at 
the input to the scrambler is mod-2 added to the register output m; to form the 


scrambled message m, which is also fed back to the register input. Thus, 
m, = m, 4,Q m, ,and 
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m, = m,Q m, [3a] 


The unscrambler has essentially the reverse structure of the scrambler and repro- 
duces the original message sequence, since 


m,O mi = (m mpm, [3b] 
= m, (m; Q m;) = m, D0 = m 


Equations (3a) and (3b) hold for any shift-register configuration as long as the 
scrambler and unscrambler have identical registers. 

The scrambling action does, of course, depend on the shift-register configura- 
tion. Table 11.4—1 portrays the scrambling produced by our illustrative scrambler 
when the initial state of the register 1s all Os. Note that the string of nine Os in m, has 
been eliminated in mg. Nonetheless, there may be some specific message sequence 
that will result in a long string of like bits in m. Of more serious concern is error 
propagation at the unscrambler, since one erroneous bit in m, will cause several out- 
put bit errors. Error propagation stops when the unscrambler register is full of cor- 
rect bits. 

Next, in preparation for the subsequent discussion of frame synchronization, we 
consider shift register sequence generation. When a shift register has a nonzero ini- 
tial state and the output is fed back to the input, the unit acts as a periodic sequence 
generator. Figure 11.4—7 shows an illustrative sequence generator using a five-stage 


shift register where the second and fifth cells are tapped and mod-2 added, and the 
result is fed back to the first stage so 


m, =m ms [4] 


and the sequence output is ms. For shorthand purposes this can also be referred to as 
a [5, 2] configuration. If the initial state of the register is 11111, it then produces a 


Table 11.4-1 














my 0 1 0 1 0 1 | 1 1 0 0 0 1 0 
Registers mia 0 0 1 0 dou 1 1 del 1 0 0 0 1 
Contents my 3 0 0 0 1 0 1 0 l 1 1l 1 0 0 0 
Mg -4 0 0 0 0 1 0 1 0. I. 4 1 1 0 0 
Register 
Output my 0 0 0 | 1 1 1 1 0 0 0 1 0 0 
Input 
Sequence m, 1 0 1 1 0 0 0 0 0 0 O0 0 0 1 
Output 


Sequence m; 1 0 i 0 ] l 1 1 0 0 0 1 0 ! 
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31-bit sequence 1111100110100100001010111011000, which repeats periodically 
thereafter. In general 


N=2"- 1 [5] 


If the appropriate feedback tap connections are made, then an n-bit register can 
produce a maximal-length (ml) sequence per Eq. (5). (Figure 11.4-7 is, in fact, an 
ml sequence generator with n — 5 and N — 31.) An ml sequence has the following 
properties: 


1. Balance. The number of ones generated is one more than the number of zeros 
generated. 


2. Run. Arun is a sequence of a single type of digit. An ml sequence will have 
one-half of its runs of length 1, one-quarter of its runs of length 2, one-eighth of 
its runs of length 3, and so on. 


3. Autocorrelation. Its autocorrelation function has properties similar to the cor- 
relation properties of random noise, in that there is a single autocorrelation peak. 

4, ‘The mod-2 addition of an ml sequence and any shifted version of it results in a 
shifted version of the original sequence. 


5. Except for the zero state, all of the 2" possible states will exist during the 
sequence generation. 


Maximal-length sequences are all also called pseudonoise (PN) sequences. The 
name pseudonoise comes from the correlation properties of PN sequences. To 
develop this point, let a PN sequence s, be used to form a binary polar NRZ signal 


s(t) = >) Qs, — 1)p(t — kT.) [6] 
k 
where p(t) is a rectangular pulse and the amplitude of the kth pulse is 
+1s5,=1 
t)=2s,—1= 7 
c(t) Sk l ee: [7] 


The signal s(t) is deterministic and periodic, with period NT,, and has a periodic 
autocorrelation function given by 


RÀT) = [(N 1) A(7/T,) — 1]/N |r| € NT,/2 [8] 
which is plotted in Fig 11.4—8. If N is very large and 7, very small, then 

















m, m» I: ma E Output sequence 
l l 1 l ] 
Figure 11.4-7 2 111010... 
Shift register sequence | 
generator with [5, 2] O 
configuration. 
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R(T) = Têt) — 1/N |r| = NT,/2 [9] 


so the PN signal acts essentially like white noise with a small dc component. This 
noiselike correlation property leads to practical applications in test instruments and 
radar ranging, spread spectrum communication (described in Chap. 15) and digital 
framing. When we do use PN generators to operate on a data stream, to avoid sig- 
nificant dc content in the PN sequence and minimize information loss to the signal 


we are operating on, we use the function c(t) instead of the polar s, for our scram- 
bling sequence. 





A 3-bit, [3, 1] shift register configuration with values of 111 produces a periodic ml 
sequence of 1110100 with N — 7. A simple way used by Dixon (1994) to calculate 
the autocorrelation function for the comparable c(t) sequence is to compare the orig- 
inal s, sequence to each of its N — 1 shifted versions (7 = 0 to 67.). Then, for each 
pair, let v(7) be the difference between the number of bit matches and mismatches, 


so the autocorrelation function is Rp, (7) = v(r)/N and hence the results shown in 
Table 11.4—2. 


EXAMPLE 11.4-1 








Table 11.4-2 

T Original/shifted u(r) Rg, (T) = vGYN 

0 1110100 7.00 1.00 
1110100 

1 1110100 — 1.00 —0.14 
0111010 

2 1110100 —1.00 —0.14 
0011101 

3 1110100 —1.00 —0.14 
1001110 

4 1110100 —1.00 —0.14 

: 0100111 

3 1110100 —1.00 —0.14 
1010011 

6 1110100 —1.00 —0.14 
1101001 

0 1110100 7.00 1.00 
1110100 


If plotted, the autocorrelation function would 1ook like the one in Figure 11.4-8 


with N = 7 and T, =.1. From this, we observe our [3, 1] shift register configuration 
produces an ml sequence. 





Show that the output sequence from a 5-stage, [5, 4, 3, 2] register with initial condi- EXERCISE 11.4-1 
tions 11111 is 1111100100110000101101010001110 and has period 31. 


EE NT TENUIS 
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Figure 11.4-8 Autocorrelation of a PN sequence. 


Prefix Start of message 











Message bits 





N-bit sync word cd] 


Figure 11.4-9 


Frame Synchronization 


A. digital receiver needs to know when a signal is present. Otherwise, the input 
noise alone may produce random output bits that could be mistaken for a message. 
Therefore, identifying the start of a message is one aspect of frame synchroniza- 
tion. Another aspect is identifying subdivisions or frames within the message. 'To 
facilitate frame synchronization, binary transmission usually includes special N-bit 
sync words as represented in Fig. 11.4-9. The initial prefix consists of several rep- 
etitions of the sync word, which marks the beginning of transmission and allows 
time for bit-sync acquisition. The prefix is followed by a different codeword label- 
ing the start of the message itself. Frames are labeled by sync words inserted peri- 
odically in the bit stream. 

The elementary frame synchronizer in Fig. 11.4—10 is designed to detect a sync 


Word 5,55 .. . sy Whenever it appears in the regenerated sequence m,. Output bits 
with the polar format 


Qro = 2m, —1= £1 
are loaded into an N-stage polar shift register having polar tap gains given by 
6c 2SN+1—i E 1 [10] 


This awkward-looking expression simply states that the gains equal the sync-word 
bits in polar form and reverse order, that is, c; = 2sy — 1 while cy = 2s, — 1. The 
tap-gain outputs are summed algebraically to form 
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N-stage polar 
shift register 


Polar 
tap gains 


Figure 11.4-10 Frame synchronizer. 





Frame 
indicator 


N 
U, = M cti D 
i=] 


This voltage is compared with a threshold voltage V, and the frame-sync indicator 
goes HIGH when v, > V. 

If the register word is identical to the sync word, then a, =.c¢; so 
cd, ; = C? = 1 and v, = N. If the register word differs from the sync word in just 
one bit, then v, = N — 2 (why?). Setting the threshold voltage V slightly below N — 
2 thus allows detection of error-free sync words and sync words with one bit error. 
Sync words with two or more errors go undetected, but that should be an unlikely 
event with any reasonable value of P,. False frame indication occurs when N or N — 
] successive message bits match the sync-word bits. The probability of this event is 


N N-1 
Py = (i) + (=) =3 x270 [12] 
ff 5 2 


assuming equally likely 1s and Os in the bit stream. 

Further examination of Eqs. (10) and (11) reveals that the frame synchronizer 
calculates the cross-correlation between the bit stream passing through the register 
and the sync word, represented by the tap gains. The correlation properties of a PN 
sequence therefore make it an ideal choice for the sync word. In particular, suppose 
the prefix consists of several periods of a PN sequence. As the prefix passes through 
the frame-sync register, the values of v, will trace out the shape of R (r) in Fig. 11.4-8 
with peaks v, — N occurring each time the initial bit s, reaches the end of the register. 
An added advantage is the ease of PN sequence generation at the transmitter, even 
with large values of N. For instance, getting Pj, < 1074 in Eq. (12) requires N > 14.8, 
which can be accomplished with a 4-stage PN generator. 
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11.5 PROBLEMS 


Sketch x(f) and construct the corresponding eye pattern (without transmission dis- 
tortion) for binary PAM with the data sequence 1011100010 when the signal has a 
unipolar format and pulse shape p(t) = cos? (wt/2T,)I1(¢/2T,). 


Do Prob. 11.1-1 with p(t) = cos (zt/2T,)1I(t/2T,). 

Do Prob. 11.1-1 with a polar format and p(t) = cos (wt/2T,) 1] (t/2T,). 
Do Prob. 11.1-1 with a bipolar format and p(t) = A(t/T,). 

Do Prob. 11.1-1 with a bipolar format. 

Modify Table 11.1~1 for an octal signal (M = 8). 


A certain computer generates binary words, each consisting of 16 bits, at the rate 
20,000 words per second. (a) Find the bandwidth required to transmit the output as 
a binary PAM signal. (b) Find M so that the output could be transmitted as an M-ary 
signal on a channel having B = 60 kHz. 


A certain digital tape reader produces 3000 symbols per second, and there are 128 
different symbols. (a) Find the bandwidth required to transmit the output as a binary 


PAM signal. (b) Find M so that the output could be transmitted as an M-ary signal on 
a telephone link having B = 3 kHz. 


Binary data is transmitted as a unipolar signal with A = 1 and p(t) = u(t + T,) — 
u(t). The transmission system's step response is g(t) = Ky(1 — e "u(t), where 
b = 2/T,. (a) Sketch P(t) and find K, such that p(0) = 1. (b) Sketch y(t) for the 
data sequence 10110 and evaluate y(t) and the ISI at the optimum sampling times. 


Do Prob. 11.1-9 for a polar signal with A/2 = 1 and a transmission system having 
b = 1/T,. 

Consider digital transmission with a gaussian pulse shape p(t) = exp [—7(bt)*], 
which is neither timelimited nor bandlimited and does not have periodic zero cross- 
ings. Let p(kD) = 0.01 for k # 0, to limit the ISI, and let the bandwidth B be such 
that P(f) = 0.01P(0) for |f| > B. Find the resulting relationship between r and B. 
Find and sketch the power spectrum of a binary PAM signal with polar RZ format and 
rectangular pulses, assuming independent and equiprobable message bits. Then show 
that the time-domain and frequency-domain calculations of x? are in agreement. 
Consider a unipolar binary PAM signal whose pulse shape is the raised cosine in 
Example 2.5-2 with total duration 27 = T,. (a) Sketch x(t) for the data sequence 
10110100. Then find and sketch the power spectrum, assuming independent 
equiprobable message bits. (b) Repeat part a with 27 = 27;,,. 


Find and sketch the power spectrum of a polar binary PAM signal with 
p(t) = sinc r, t when the message bits are independent but have unequal probabili- 
ties, say a and 1 — a. Use your sketch to show that x? is independent of a. 
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11.1-16 
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Find and plot the power spectrum of a binary signal with the split-phase Manchester 
format in Fig. 11.1—1c assuming independent equiprobable message bits. Compare 
your result with Fig. 11.1-6. 


Verify the correlation properties of bipolar precoding given in Eq. (14a) by tabulat- 
ing the possible sequences a,.,a, and their probabilities for n = 0,n = 1, and 
n = 2. Assume independent equiprobable message bits. 


Let g(n) be a function of the discrete variable n. By expanding and rearranging the 
summation, show that 





> dee-)=@K+1) »0- P d 


k--K i=—K Ec 2K + 1 


Apply this result to carry out the manipulation of the sum p,{f) used to derive 


Gf) from EL IX COP]. 


Consider a binary signal constructed using different pulse shapes p,(t) and po(t) to 
represent 1s and Os. For purposes of analysis, we can write 


x(t) = D [aun(t — kT) + but — kT;)] 
k 
where a, equals the kth bit and b, = 1 — a,. (a) Assuming independent and equi- 
probable bits, show that a, = b, = aj = b; = 1/2, a, b, = 0, and a, a; = b, b; = 
a,b; = 1/4 for i#k. (b) Now form the truncated signal x(t) with 
T= (2K + 1)T, => T, and show that 


EB GP] = SP) - Por 








K K 


+ HP) PERE S. pede 


k--K i--K 
where P,(f) = &[pj(r)] and Po(f) = &[pe(1)]. (c) Finally, use the relationship in 
Prob. 11.1—17 and Poisson's sum formula to obtain 


r 


Gf) = P0) - POP 


a 
1 


[IPi(nry) — Po(nr,) PF — nr,) 


il 


Em 
ws 


n---oo 


Find (S/N), such that a unipolar binary system with AWGN has P, = 0.001. What 
would be the error probability of a polar system with the same (S/N),? 


A binary system has AWGN with Ny = 1078/r,. Find Sẹ for polar and unipolar sig- 
naling so that P, = 107°. 
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Some switching circuits generate impulse noise, which can be modeled as filtered 
white noise with average power o? and an exponential PDF 


Eu e sg Vae 
207 
(a) Develop an expression for P, in terms of A and c for polar binary signaling con- 


taminated by impulse noise. (b) Compare the effect of impulse noise with that of 
gaussian noise by considering the condition P, = 0.001. 


Consider a polar binary system with ISI and AWGN such that y(t,) = a, + 
€, + n(t,), where the ISI e, is equally likely to be +g or —a@. (a) Develop an expres- 
sion for P, in terms of A, a, and ø. (b) Evaluate P, when A = 80 anda = 0.1 A. 
Compare your result with the error probability without ISI. 


Do Prob. 11.2-4 taking e, = +a, 0, and —a with probabilities 0.25, 0.5, and 0.25, 
respectively. 


Use Eq. (4) to obtain an expression for the optimum threshold in a polar binary sys- 
tem with AWGN when P, # Pj. 


Derive Eq. (4) using Leibniz's rule, which states that 


a [+2 db(z) 
p (ZA) dA = giz, bl) 
d IE EM BUERNE dz 





a We) 
CAD) a. | = [ae A)] aa 
az) ^ 


where z is an independent variable and a(z), b(z), and g(z, A) are arbitrary functions. 


A polar binary system has 20 repeaters, each with (S/N), = 20 dB. Find P, when 
the repeaters are regenerative and nonregenerative. 


A polar binary system with 50 repeaters is to have P, = 107^. Find (S/N), in dB 
when the repeaters are regenerative and nonregenerative. 


Consider the split-phase Manchester format in Fig. 11.1-1(d), where 


1 = T,/2<t<0 
p(t) = 
-1 0< t< T,/2 


Plot the matched filter's impulse response. Then use superposition to plot the pulse 
response a, p(t — to) * h(t) and compare with Fig. 11.2-6(c). 


Consider a unipolar RZ binary system with p(t) = u(t) — u(t — T,/2). Instead of a 
matched filter, the receiver has a first-order LPF with impulse response 
h(t) = Kg e u(t) where Kg = b/(1 — e 7%"). (a) Find and sketch Ap(t) » h(t) 
and obtain the condition on b such that the ISI at any subsequent sampling time does 


not exceed 0.14. (b) Show that (A/20)* = (4br,/K3)y, = 0.812y,, where the 
upper bound comes from the ISI condition. ` 
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A binary data transmission system is to have r, = 500 kbps and P,e = 10 ^. The 
noise is white gaussian with Ny = 10^" W/Hz. Find the minimum value of Sg 
when: (a) M = 2; (D) M = 8 with Gray coding. 

Suppose the transmission bandwidth of the system in Prob. 11.2-12 is B = 80 kHz. 


Find the smallest allowed value of M and the corresponding minimum value of Sp, 
assuming a Gray code. 


A binary data transmission system has AWGN, y, = 100, and Gray coding. What's 
the largest value of M that yields Pj, = 107°? 
Derive the result stated in Eq. (21). 


Suppose binary data is converted to M = 4 levels via the Gray code in Table 11.1-1. 


Use Fig. 11.2-8 to derive an expression for P, in terms of k = A/2o. Simplify your 
result by assuming k > 1. 


Do Prob. 11.2-16 with the natural code in Table 11.1-1. 


Use Eq. (3a) to find and sketch p(t) and P(f) when P&(f) = (1/28)TI(f/2B) with 
B = r[4. 
Use Eq. (3a) to find and sketch p£) and P(f) when P,(f) = (1/28)11(//2B) with 
B = rf3. 
Suppose B = 3 kHz. Using pulses with a cosine rolloff spectrum, what is the maxi- 


mum baud rate if the rolloffs are: (a) 100 percent, (b) 50 percent, (c) 25 percent? 


Given a binary message of 10110100, sketch the baseband waveform using 50 per- 
cent rolloff raised cosine pulses. How does this compare with Figure 11.3—2? 


We want to transmit binary data at 56 kbps using pulses with a cosine rolloff spec- 
trum. What is B for rolloffs of: (a) 100 percent, (b) 50 percent, (c) 25 percent? 
Obtain Eq. (6b) from Eq. (5) with B = r/2. Then show that p(+D/2) = 1/2. 
Carry out the details leading to Eqs. (4b) and (5) starting from Eq. (4a). 


A more general form of Nyquist’s signaling theorem states that if P(f) = #[ p(t) | 
and 


ee edu — co «f € oco 


n-—-—oo 


then p(t) has the property in Eq. (1a) with D = 1/r. (a) Prove this theorem by tak- 
ing the Fourier transform of both sides of 


p(t) » é(t — kD) = > p(kD)8(t — kD) 


k--—oo --—o09 


Then use Poisson's sum formula. (b) Use a sketch to show that P( f) in Eq. (4b) sat- 
isfies the foregoing condition. 
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Consider an arbitrary bandlimited pulse spectrum with P(f) = 0 for |f| = B. Use 
simple sketches to obtain the additional requirements on P(f) needed to satisfy 
Nyquist's signaling theorem as stated in Prob. 11.3-5 when: (a) r/2 < B < r; (b) 
B = r[2; (c) B < r/2. 


A binary data system is to be designed for r, = 600 kbps and P,, = 10 ?. The wave- 
form will have M = 2”, Gray coding, and Nyquist pulse shaping. The noise is white 
gaussian with Ny = 1pW/Hz. The transmission channel has loss L = 50 dB and is 
distortionless over the allocated bandwidth B = 200 kHz. Choose M to minimize the 
transmitted power, and find the resulting values of r, B, and Sz. 


Do Prob. 11.3-10 with B = 120 kHz. 
Do Prob. 11.3-10 with B = 80 kHz. 


Consider a data transmission system with M = 2, r = 20,000, p,(t) = II(2rt), 
IH-(f)| = 0.01, G,(f) = 1079(1 + 3 x 107*|£|)?, and p(s) per Eq. (6b). (a) Find 
and sketch the amplitude ratio for the optimum terminal filters. (b) Calculate $7 
needed to get P, = 1076, assuming gaussian noise. 


Consider a data transmission system with M = 4,r = 100, p,(t) = II(10rt), 
G,(f) = 1079, | Hif)? = 107$/(1 + 32 x 107f?), and p(t) = sinc rt. (a) Find 
and sketch the amplitude ratio for the optimum terminal filters. (b) Calculate S; 
needed to get P, = 1076, assuming gaussian noise. 


Consider a polar system in the form of Fig. 11.33 with G,(f) = No/2 and a time- 
limited input pulse shape p,(t). Let Hf) = K and H«(f) = [P(f)H&f)]*e 7", 
so the receiving filter is matched to the received pulse shape. Since this scheme does 
not shape the output pulse p(t — t,) for zero ISI, it can be used only when the dura- 
tion of p,(t) is small compared to I/r. (a) Obtain an expression for K such that 


p(0) = 1. Then develop expressions for Sy and c? to show that (A/2c)’ is given by 
Eq. (12a) with 





Nol (PLP df 
Iu E —= '7 
2 | IPC) He C£) P af 


(b) Show that this result is equivalent to the maximum value (A/20)? = 
6SR/(M? — 1)Nor. 


A certain system has been built with optimum terminal filters for Nyquist pulse 
shaping, assuming white noise and a distortionless channel with loss L. However, it 
turns out that the channel actually introduces some linear distortion, so an equalizer 
with Half) = (V Lu« f)] ! has been added at the output. Since the terminal fil- 
ters were not modified to account for the channel distortion, the system has less than 
optimum performance. (a) Obtain expressions for S; and c? to show that 
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(b) Evaluate K in dB when P(f) is as given by Eq. (6a) and Hc(f)-— 
(Vp agr 

Find the tap gains for a three-tap zero-forcing equalizer when p., — 04, 
Po = 1.0, P, = 02, and P, = 0 for |k| > 1. Then find and plot p,,(t,). 

Obtain expressions for the tap gains of a three-tap zero-forcing equalizer when 
P-1 = €, Po = 1, Dı = ô, and p, = O for |k| > 1. 

Find the tap gains for a five-tap zero-forcing equalizer for p(t) in Fig. 11.3-6a. (You 


can solve the simultaneous equations by successive substitution.) Then find the 
resulting values of p,,(t,) and compare with Fig. 11.3—6b. 


Consider binary correlative coding with N = 2, cg = 1, c, = 2, and c; = 1. (a) Find 
and sketch A(t) and |H (f£). (b) Use Eqs. (20b) and (22) to develop an expression for 
y(t,) like Eq. (23b). 


Do Prob. 11.3220 with N = 4,cg = 1,c = 0,c2 = — 2,0, = 0, ande, = 1. 





Consider a duobinary system with H(f) as given by Eq. (21b), p,(t) = 6(t), 
H-(f) = M VL, and gaussian white noise. Obtain an expression for (A/2c)” and 
apply Schwarz’s inequality to show that the optimum filters have 
IHsC£) |? = gL|A(f)| and |Ag(f)|? = |HC£)|/g. Then derive Eq. Q5). 


Consider a duobinary system with an input data sequence of 101011101 and A = 2. 
(a) Find the coder output and then verify the receiver will have the correct output. 
(b) Calculate the dc value of the coder's output. (c) What is the receiver's output if 
the third bit has a value of zero? 


Do Prob. 11.3-23 with a precoder. 
Do Prob. 11.3-23 with modified duobinary signaling and precoding. 


Given a five-stage shift register scrambler/unscrambler system with m; = 
mj. D m, 4 D m,- and zero initial shift register conditions, compute the scram- 
bled output and the output from the corresponding unscrambler for an input 


sequence of m, — 011111101110111. What are the dc levels for the unscrambled 
and scrambled bit streams? 


Suppose we have a five-stage shift register sequence generator with a [5, 4, 3, 2] 
configuration, and its contents are all initially ones. Determine the output sequence, 


its length, and plot the corresponding autocorrelation function. Is the output an ml 
sequence? 


Do Prob. 11.4—2 using a shift register with a [4, 2] configuration. 
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H aving studied the basic concepts of digital transmission in Chap. 11, we return once more io anolog communi- 
cation. But now we consider digital transmission of analog messages via coded pulse modulation. Coded pulse 
modulation systems employ sampling, quantizing, and coding to convert analog waveforms into digital signals. 

Digital coding of analog information produces a rugged signal wiih a high degree of immunity to transmission 
distortion, interference, and noise. Digital coding also allows the use of regenerative repeaiers for long-distance anc- 
log communication. However, the quantizing process essential for digital coding results in quantization noise which 
becomes the fundamental limitation on waveform reconstruction. To keep the quantization noise small enough for suit 
able fidelity, a coded puise modulation system generally requires a much larger bandwidth than a comparable ana- 
log transmission system. 

We'll develop these properties first in conjunction with pulse-code modulation [PCM]. Next, we describe 
delta modulation (DM) and other schemes that involve predictive coding. To better appreciate the advantages 
of digital over analog systems we will look at digital audio recording using the audio compact disk (CD). We 
then consider digital multiplexing, a valuable technique that makes it possible to combine analog and digital 
information for transmission in the form of a multiplexed digital signal. Finally, the chapter closes with a brief discus- 


sion of computer networks by considering the Open Systems Interconnection |OSI) and the Transmission 
Control Protocol/Internet Protocol [TCP/IP} sysiems. 


OBJECTIVES 





After studying this chapter and working the exercises, you should be able to do each of the following: 


1. Define and relate the parameters of a PCM system, and distinguish between quantization noise and random noise 


(Sect. 12.1). 

2. Find the conditions for PCM transmission above threshold, and calculate the value of (S/N); (Sects. 12.1 and 
12.2). 

3. Identify and compare the distinctive features and relative advantages of PCM (with and without companding), 
delta modulation, and differential PCM (Sects. 12.2 and 12.3). 

4. Describe the operation of a compact disk digital audio system, how it achieves error control, and its advantages 
over analog systems (Sect. 12.4). 

5. 


Diagram a digital multiplexing system that accommodates both analog and digital signals in a standard multi- 
plexing hierarchy including the North American and CCIT (Sect. 12.5). 


6. Explain the concepts of the Integrated Services Digital Nework (ISDN) and Synchronous Optical Network 
(SONET) hardware multiplexing schemes (Sect. 12.5). 


Describe the concepts of packet switching, frame relay, and asynchronous transfer mode (ATM) data switching 
schemes (Sect. 12.5). 


8. Explain the OSI and TCP/IP computer network architectures (Sect. 12.6). 
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12.1 PULSE-CODE MODULATION 


This section describes the functional operation of pulse-code modulation (PCM) 





log-to-digital converter’ DC} af the input and a 


When the digital error probability is sufficiently small, PCM performance as an 
analog communication system depends primarily on the quantization noise intro- 
duced by the ADC. Here we'll analyze analog message reconstruction with quanti- 


zation noise, temporarily deferring to the next section the effects of random noise 
and digital errors. 


PCM Generation and Reconstruction 


Figure 12.1—1a diagrams the functional blocks of a PCM generation system. The 
analog input waveform x(t) is lowpass filtered and sampled to obtain x(kT,). A 
quantizer rounds off the sample values to the nearest discrete value in a set of q 
quantum levels. The resulting quantized samples x,(kT,) are discrete in time (by 
virtue of sampling) and discrete in amplitude (by virtue of quantizing). 

To display the relationship between x(kT,) and x,(AT,), let the analog message 
be a voltage waveform normalized such that lx(f)| = 1 V. Uniform quantiza- 
tion subdivides the 2-V peak-to-peak range into q equal steps of height 2/g V, 
as shown in Fig. 12.1-1b. The quantum levels are then taken to be at 
+1/q, =3/q,..., (q — 1)/q in the usual case when q is an even integer. A quan- 
tized value such as x,(kT,) = 5/q corresponds to any sample value in the range 
4/q < x(kT,) < 6/q. 

Next, an encoder translates the quantized samples into digital code words. The 
encoder works with M-ary digits and produces for each sample a codeword consist- 
ing of v digits in parallel. Since there are M" possible M-ary codewords with v dig- 
its per word, unique encoding of the q different quantum levels requires that M" = q. 
The parameters M, v, and q should be chosen to satisfy the equality, so that 


q = M” v = logyg 1] 
Thus, the number of quantum levels for binary PCM equals some power of 2, 
namely g = 2”. 


Finally, successive codewords are read out serially to constitute the PCM wave- 
form, an M-ary digital signal. The PCM generator thereby acts as an ADC, perform- 
ing analog-to-digital conversions at the sampling rate f, = l/T, A timing circuit 
coordinates the sampling and parallel-to-serial readout. 
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(c) PCM generation system; (b) quantization characteristic. 


Each encoded sample is represented by a v-digit output word, so the signaling 


rate becomes r = vf, with f, = 2W. Therefore, the bandwidth needed for PCM 
baseband transmission is 


Br =}r = wf, = vW [2] 


Fine-grain quantization for accurate reconstruction of the message waveform 
requires q œ> 1, which increases the transmission bandwidth by the factor 
v = logy q times the message bandwidth W. 

Now consider a PCM receiver with the reconstruction system in Fig. 12.1-2a. 
The received signal may be contaminated by noise, but regeneration yields a clean 
and nearly errorless waveform if (S/N), is sufficiently large. The DAC operations 
of serial-to-parallel conversion, M-ary decoding, and sample-and-hold generate the 
analog waveform x,(f) drawn in Fig. 12.1-2b. This waveform is a "staircase" 
approximation of x(t), similar to flat-top sampling except that the sample values 
have been quantized. Lowpass filtering then produces the smoothed output signal 


eu dri a ir ala 


12.1 Pulse-Code Modulation 


















DAC 
PCM v digits 
+ noise Serial to 
parallel M-ary s |LO.| Ler Dol 
(S/N) converter decoder W 
Sync 
(a) 








Figure 12.1-2 {a} PCM receiver; [b] reconstructed waveform. 


yp (t), which differs from the message x(f) to the extent that the quantized samples 
differ from the exact sample values x(kT,). 

Perfect message reconstruction is therefore impossible in PCM, even when 
random noise has no effect. The ADC operation at the transmitter introduces per- 
manent errors that appear at the receiver as quantization noise in the reconstructed 


signal. We'll study this quantization noise after an example of PCM hardware 
implementation. 





Suppose you want to build a binary PCM system with q = 8sov = log, 8 = 3 bits 
per codeword. Figure 12.1—3a lists the 8 quantum levels and two types of binary 
codes. The “natural” code assigns the word 000 to the lowest level and progresses 
upward to 111 in the natural order of binary counting. The sign/magnitude code uses 
the leading bit b, for the algebraic sign of x, while the remaining bits bb; represent 
the magnitude. Other encoding algorithms are possible, and may include additional 
bits for error protection—the topic of Chap. 13. 

A direct-conversion ADC circuit for the sign/magnitude code is shown in 
Fig. 12.1—3b. This circuit consists of one comparator for the sign bit and three par- 
allel comparators plus combinational logic to generate the magnitude bits. Direct- 
conversion ADCs have the advantage of high operating speed and are called flash 


EXAMPLE 12.1-1 
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Figure 12.1-3 {a} Binary PCM codes for q = 8; (b) directconversion ADC circuit for 
sign/magnitude code; (c) weighted-resistor decoder circuit. 


encoders, but they require a total of g/2 comparators. At lower speeds you can get 
by with one comparator and a feedback loop, a configuration found in the dual- 
slope, counter-comparison, and successive-approximation encoders. 

Figure 12.1-3c shows the circuit for a weighted-resistor decoder that goes 
with a 3-bit sign/magnitude code. The sign bit operates a polarity selector switch, 
while the magnitude bits control the resistors to be connected to the reference volt- 


age. The overall circuit acts as an inverting op-amp summer with output voltage 
(—1)5(4b, + 2b, + 1)/8. 


Direct Digital Synthesis of Analog Waveforms 
An alternative to the PLL frequency synthesizer is the Direct Digital Synthesis 


(DDS) as shown in Fig. 12.1-4. Here a waveform is quantized with the samples 
stored in computer memory. The memory contents are repeatedly sent to a DAC 
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Figure 12.1-4 DDS for waveform generation. 


which converts these to an equivalent analog signal. The rate of the output samples 
determines the waveform's frequency. 





EXERCISE 12.1-1 


A binary channel with r, = 36,000 bits/sec is available for PCM voice transmission. 
Find appropriate values of v, q, and f, assuming W = 3.2 kHz. 





Quantization Noise 


Although PCM reconstruction most often takes the form of staircase filtering, as in 
Fig. 12.12, we'll find the impulse reconstruction model in Fig. 12.1-5 more con- 
venient for the analysis of quantization noise. Here, a pulse converter in place of the 
sample-and-hold circuit generates the weighted impulse train 


yat) = M [x(kT;) + e] 8(t — kT;) [3a] 
k 
Where e, represents the quantization error, namely 
epe A (kT,) — x(kT,) [35] 
Lowpass filtering with B = f,/2 yields the final output 


yp(t) = x(t) + S e, sinc (ft — k) [a] 


This expression has the same form as reconstruction of analog pulse modulation 
with noisy samples; see Eq. (2) Sect. 10.6. Furthermore, when q is large enough for 
reasonable signal approximation, the e, will be uncorrelated and independent of x(t). 
Accordingly, we identify e? as the mean-square quantization noise. 
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Figure 12.1-5 Impulse reconstruction model. 
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Round-off quantization with equispaced levels ensures that |e,| = 1/g. Lacking 
additional information to the contrary, we assume that the quantization error has 
zero mean value and a uniform probability density function over —1/q = e, S 1/q. 
Thus, the quantization noise power is 





2 2 1 t 2 l 
j= a=] de= [5] 


g 
= 1/4 24 


which reflects the intuitive observation that the quantization noise decreases when 





the number of quantum levels increases. 4 
Now we measure PCM performance in terms of the destination signal power | 

Sp = x? = S, x 1 and the quantization noise power oe The destination signal-to- 3 
noise ratio then becomes 3 
s) S, E 

>} -— -3qs [6] E 

2 x A 

G D Og a 


4 
i 


A more informative relation for binary PCM is obtained by setting g = 2" and 
expressing (S/N )p in decibels. Thus, 


S 


(=) = 10 logy) (3 X 27*S,) < 4.8 + 60v dB [7] 
D 


where the upper bound holds when S, = 1. Voice telephone PCM systems typically 
have v = 8 so (S/N)p = 52.8 dB. 

But many analog signals—especially voice and music—are characterized by a 
large crest factor, defined as the ratio of peak to rms value, |x(t)|max/@,x- Our signal 
normalization establishes |x(t)|na, = 1, and a large crest factor then implies that 


KEEN EOM 





5, = o} << 1. Consequently, the actual signal-to-noise ratio will be significantly 
less than the theoretical upper bound. For instance, some digital audio recording 
systems take v = 14 to get high-fidelity quality with (S/N)p = 60 dB, compared to 
(S/N )p = 88.8 dB predicted by Eq. (7). As a bandwidth-conserving alternative to 
increasing v, PCM performance may be improved through the use of companding, 
equivalent to nonuniform quantization. 


EXAMPLE 12.1-3 





strengths of the maternal and fetal components are on the order of 1 volt and 100 uY 
respectively. Based on these signal levels, let's assume that the fetal signal has 
$,— 5 x 107°. If we employ a 1-volt bipolar 12-bit ADC, then using Eq. (7) we get 
a (S/N)p = 4.8 + 6 x 12 + 101og (5 X 107?) = —62 dB. On the other hand, if 
we use a 16-bit ADC, we get a (S/N)p = 4.8 + 6 X 16 + 101log(5 x 10?) = 

17.8 dB. Furthermore, if we compare these two cases with respect to voltage levels, 
the 12-bit ADC gives us a peak-to-peak quantization noise of 2/2" = 488 uV 
which exceeds the 100 uV input signal level. However, with the 16-bit ADC, we 
only get a peak-to-peak quantization noise of 31 wV. 
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EXERCISE 12.1-2 


Consider binary PCM transmission of a video signal with f, = 10 MHz. (a) Calcu- 
late the signaling rate needed to get (S/N), = 50 dB when S, = 1. (b) Repeat (a) 
with S, = 0.1. 





Nonuniform Quantizing and Companding* 


A normalized signal x(t) with a large crest factor can be represented by the typical 
probability density function p,(x) sketched in Fig. 12.1-6. The > even symmetry and 
absence of an impulse at x = O correspond tox = 0, so 02 = x? = S, and 


1 1 
S, = | x'py(x) dx = 2 | Xx" px) dx [8] 
-1 0 
This integration yields S, < 1 because the PDF has a dominant peak at x = 0. 
The shape of p,(x) also means that |x(t)| << 1 most of the time. It would there- 
fore make good sense to use nonuniform quantization as indicated by the dashed 
lines. The quantum levels +x), ..., Xx,» are closely spaced near x = 0, but more 
widely spaced for the large values of |x(t)| which occur infrequently. We calculate 
the resulting quantization noise as follows. : 
Consider a sample value x = x(kT,) in the band a; < x < b; around the quan- 
tum level x;. The quantizing error e; = x; — x then has the mean square value 





bi 
e = | (x; — x)y!py(x) dx [9a] 


ai 


Summing €? over all 4 levels gives the quantization noise 


: q/2 
o= 2 > €: [9b] 


" 








Figure 12.1-6 Message PDF with nonuniform quantization bands. 
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where we've taken advantage of the even symmetry. In the usual case of q >> 1, the 
step height A; = b; — a; will be small enough that p(x) = px(x;) over each integra- 


tion band and x; will fall roughly in the middle of the step. Under these conditions 
Eq. (9a) simplifies to 


m x, +h/2 k? 
€ s præ) | (x; — x) dx — PAX) Ty 
x; —h,f2 
and thus 
P 1 22 " 
9 5 X plxd)h [10] 
i=l 


As a check on this expression, we note that if the signal has the uniform PDF 
px(x) = 1/2 and if the steps have equal height h;=2/q, then of = 
(1/6)(4/2)(1/2)(2/4)? = 1/3q? which agrees with our earlier result in Eq. (5). 

Theoretically, you could optimize PCM performance by finding the values of 
X, Ap and b; that result in minimum quantization noise. Such optimization is a diffi- 
cult procedure that requires knowledge of the signal's PDF. Additionally, the custom- 
tailored hardware needed for nonlinear quantizing costs far more than standard 
uniform quantizers. Therefore, the approach taken in practice is to use uniform quan- 
tizing after nonlinear signal compression, the compression characteristics being 
determined from experimental studies with representative signals. 

Figure 12.1—7 plots an illustrative compressor curve z(x) versus x for 0 = x = 1; 
the complete curve must have odd symmetry such that z(x) = —2(|x|) for 
—] =x <0. Uniform quantization of z(x) then corresponds to nonuniform quanti- 
zation of x, as shown in the figure. The nonlinear distortion introduced by the com- 
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Figure 12.1-7 Compressor characteristic curve. 
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pressor is corrected after reconstruction by a complementary expander, identical to 
the companding strategy discussed in Sect. 3.2. Hence, the DOS ECHO signal-to- 
noise ratio for companded FENI is (S/N)5 = S,/o?, with oj given by Eq. (10). 

Our next task is to obtain as in terms of the compressor as For that purpose let 


^ dzx 


so z'(x;) equals the slope of z(x) at x = x; The conditions q => 1 and A; «& 1 jus- 
tify the approximation z'(x;) = (2/q)/h;, so 


a" [ASI oat 


Equation (10) then becomes 


4 3? 2 (! px(x) 
adsa 





P 











a [z ar Bq” 4 [2 GO 
where we've passed from summation to integration via h; — dx. Therefore, our final 
result is 
S ae ese 
(=) at [11a] 
N D Tq K, 
with 
1 
x 
rêl pal) [116] 
o i609] 


which takes account of the compression. If K, < 1 then (S/N)5 > 3478, and com- 
panding improves PCM performance by reducing the quantization noise. 
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p-Law Companding for Voice PCM 


The popular 4-law companding for voice telephone PCM employs the following 
compressor characteristic 


In(1 + pelx|/xmax) 
x). 0253 SS ee ] ^ P b = 12 


where x44, is the maximum input level that can be quantized without overload, and 
in the United States, the standard value for u is 255. The function z(x) then feeds 
the input of an n-bit uniform quantizer to generate x}. To get back an estimate of the 
original voice signal, we apply the quantized sional to an expander which is the 
inverse of Eq. (12) or 


Xmax Xa I ^ 
zs) = “BE (ce aye — 1] sgn Goa) 2 nal 


EXAMPLE 12.1-4 
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If we assume normalized inputs, such that |x| < | then from Eq. (12) we get 


; H 1 
z (x) = ———— ——_ 14 
e) In (1 + u) 1 + plz] i 
The parameter yz is a large number so z'(x) >> lfor|x| << 1 whereasz'(x) << 1 for 
|x| = 1. Substituting z'(x) into Eq. (115) and performing the integration, we obtain 
In? (1 + p x 
K,= ZU tUi + 2u|x| + wS,) [15] 
E 3 
Now we need values for [x] and S, to test the efficacy of -law companding. i 
Laboratory investigations have shown that the PDF of a voice signal can be i 
modeled by a Laplace distribution in the form ; 





px(x) = 2 e ~ell [16a] 
with 
2 — | [S 
= G7? = = = — = T 
$.,7 0; X |x| d [165] 


This distribution cannot be normalized for |x(t)|max = 1, but the probability of 
|x(t)| > 1 will be less than 1 percent if S, < 0.1. Other voice PDF models yield 
about the same relationship between |x| and S,, which is the critical factor for eval- 


uating K,. E 
Taking the standard value w = 255 and putting |x| = VS,/2 in Eq. (15), we 
obtain 


K, = 4.73 X 107*(1 + 361 V/S, + 65,0255,) 


Numerical calculations then show that K, « 1 for S, « 0.03. But more significant 
is the fact that S./K, stays nearly constant over a wide range of S,. Consequently, 
p-law companding for voice PCM provides an essentially fixed value of (S/N )p, 
despite wide variations of S, among individual talkers. Figure 12.1—8 brings out this 
desirable feature by plotting (S/N), in dB versus S, in dB, with and without com- 
panding, when g = 2°. Notice the companding improvement for S, < —20 dB. 





EXERCISE 12.1-3 Companding will reduce the quantization errors. Consider a u = 255 compandor to 


be used with a v = 3 bit quantizer where the output varies over +8.75 V. For an 
input of 0.6 V, what is the e, with and without companding? 








12.2 PCM WITH NOISE 


In this section we account for the effects of random noise in PCM transmission. The 
resulting digital errors produce decoding noise. After defining the error threshold 
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Figure 12.1-8 PCM performance with u-law companding. 


level, we'll be in a position to make a meaningful comparison of PCM with analog 
modulation methods. 


Decoding Noise 


Random noise added to the PCM signal at the receiver causes regeneration errors 
that appear as erroneous digits in the codewords. The decoder then puts out a differ- 
ent quantum level than the one intended for a particular sample. 

Hence, 





The analysis of decoding noise is not too difficult if we restrict our attention to 
binary PCM with uniform quantization and a relatively small bit error probability 
P,. The number of bit errors in a v-digit codeword is a random variable governed by 
the binomial distribution. However, when P, << 1, the probability of one error in a 
given word approximately equals vP, and the probability of two or more errors is 
small enough for us to ignore that event. Of course, the effect of a single error 
depends on where it falls in the word, because the different bit positions have differ- 
ent decoding interpretations. 
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Consider a “natural” binary codeword of the form b, 5, ; '' bibo in which the 
mth bit distinguishes between quantum levels spaced by 2" times the step height 
2/q. (The sign/magnitude code illustrated back in Fig. 12.1—3 follows this pattern for 
the magnitude bits, but the sign bit has a variable level meaning that makes analysis 
more complicated.) An error in the mth bit then shifts the decoded level by the 
amount €,, = +(2/q)2”, and the average of eĉ over the v bit positions equals the 
mean-square decoding error for a random bit-error location. Thus, 


DES (2 | 4 v—] 
= — —2m = ——— 4m ] 
n TA [1] 


m=0 


3 | 





44-1 4¢-1 a4 
vq 3 àv e 3v 

where we've used the formula for summing a geometric progression and substituted 

4” = 2?" = q* >> |. The decoding noise power is therefore 


03 = VP em ™ 3P, [2] 
since erroneous words occur with probability vP,. 


The total destination noise power consists of decoding noise o2 and quantization 
noise o? = 1/3q*, which come from essentially independent processes. Therefore, 





$ EL 
iL I ECT ME 
and 
S 3g? 
( ) see [3] 
N D 1+ 4q*P, 


so the effect of decoding noise depends upon the relative value of the quantity 
Aq? P,. Indeed, we see in Eq. (3) the two extreme conditions 


=4 3 [4] 


s 3q°S, P, « 1/4q* 
(5). Sl. P> ijg 


4P, 


Quantization noise dominates when P, is small, but decoding noise dominates and 
reduces (S/N )5 when P, is large compared to 1/4q°. 

Now recall that the value of P, is determined by the received signal-to-noise 
ratio (S/N Jg at the input to the digital regenerator. Specifically, for polar binary sig- 
naling in gaussian white noise we know that P, = Q[ V(S/N) |. Figure 12.2-1 
plots (S/N )5 versus (S/N) for this case, with S, = 1/2 and two values of q. The 
precipitous decline of (S/N)p as (S/N) decreases constitutes a threshold effect 
caused by increasing errors. Below the error threshold, when P, >> 1/4q’, the 
errors occur so often that the reconstructed waveform bears little resemblance to the 
original signal and the message has been mutilated beyond recognition. 
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Figure 12.2-1 Noise performance of PCM. 


Error Threshold 


The PCM error threshold level is usually defined at the point where decoding noise 
reduces (S/N )p by 1 dB. Unfortunately, this definition does not lend itself to analyt- 
ical investigations. As a more convenient alternative, we'll say that decoding errors 
have negligible effect if P, - 10 7. Then we obtain the corresponding condition on 
(S/N), for polar M-ary signaling using Eq. (20) Sect. 11.2, namely 


(i-e o 
dn m/e M*—1NN/g| 


Solving for the minimum value of (S/N), yields the threshold level 


9 = 6(M* — 1) [5] 


th 





If (S/N )p < 6(M* — 1), the PCM output will be hopelessly mutilated by decoding 
noise. 

A subtle but important implication of Eq. (5) relates to the digital signaling rate 
and transmission bandwidth. We’ll bring out that relationship with the help of the 
analog transmission parameter y = S4/N9W = (Bz/W)(S/N )s. The PCM transmis- 
sion bandwidth is By = r/2 = vW, so 


Ya = (Br/W)(S/N )r, [6] 


B 
~ E OP -1)z6vw(M? — 1) 
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Given v and M, this equation tells you the minimum value of needed for PCM 
operation above threshold. It also facilitates the comparison of PCM with other 
transmission schemes. 





EXAMPLE 12.2-1 Example 12.1—4 showed that a voice PCM system with M = 2, v = 8, and u-law 
companding has (S/N), ~ 37 dB. Equation (6) gives the corresponding threshold 
level yq 2 144 ~ 22 dB. Hence, the PCM system has a potential 15-dB advantage 
over direct analog baseband transmission in which (S/N)5 = y. The full advantage 
would not be realized in practice where allowance must be made for By > vW and 
Y 2 Yw 





EXERCISE 12.2-1 The 1-dB definition for error threshold is equivalent to 10 logio (1 + 4g°P.) = 1. 


o 


Calculate P, from this definition when q = 2°. Then find the corresponding value of 
(S/N )& for polar binary signaling, and compare your result with Eq.(5). 





PCM versus Analog Modulation 


The PCM threshold effect reminds us of analog modulation methods such as FM 
and PPM that have the property of wideband noise reduction above their threshold 
levels. As our initial point of comparison, let's demonstrate that PCM also provides 
wideband noise reduction when operated above threshold so (S/N)p = 39?5,. For 
this purpose, we'll assume that the sampling frequency is close to the Nyquist rate 
and By ~= vW. Then q = M" ~= M” where b = B,/W is the bandwidth ratio. Hence, 


S 
2) ess 7 
by. f : i 


which exhibits noise reduction as an exponential exchange of bandwidth for signal- 
to-noise ratio. Granted that the noise being reduced is quantization noise, but ran- 
dom noise has no effect on PCM above threshold. The exponential factor in Eq. (7) 
is far more dramatic than that of wideband analog modulation, where (S/N)p 
increases proportionally to b or b’. 

For further comparison including threshold limitations, Fig. 12.2-2 illustrates 
the performance of several modulation types as a function of y. All curves are cal- 
culated with S, = 1/2, and the heavy dots indicate the threshold points. The PCM 
curves are based on Eqs. (6) and (7) with M = 2 and v = b. 

Clearly, in the name of power efficiency, PCM should be operated just above 
threshold, since any power increase beyond y = Yp yields no improvement of 
(S/N)5. Near threshold, PCM does offer some advantage over FM or PPM with the 
same value of b and (S/N). And even a 3-dB power advantage, being a factor of 2, 
may spell the difference between success and failure for some applications. But that 
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Figure 12.2-2 Performance comparison of PCM and analog modulation. 


advantage is gained at the price of more complicated and costly hardware. In fact, 
PCM was deemed totally impractical prior to the development of high-speed digital 
electronics in the late 1950s—two decades after the invention of PCM. 

Two other benefits don't appear in Fig. 12.2-2. 





Another benefit comes from the fact that 





Digital multiplexing then makes it possible to combine PCM and digital data signals 
for flexible and efficient utilization of a communication channel. Taken together, 
these two benefits account for PCM's preeminence in the design of new systems for 
long-distance telephony. 

But PCM 1s not suited to all applications. In radio broadcasting, for instance, 
we want a relatively large signal-to-noise ratio, say ($/N)p ~ 60 dB. Figure 12.222 
reveals that this would require binary PCM with b > 8, or FM with a smaller band- 
width ratio b — 6 and much simpler hardware at the transmitter and receivers. 
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Likewise, bandwidth and hardware considerations would reject PCM for most 
single-channel systems. 





EXERCISE 12.2-2 Starting with Eqs. (6) and (7), show that a PCM system operated at the threshold 


point has 
S Yih ) : 
—) =3[14+— 
(5). ? ( 6b Sx n 


Compare this expression with that of WBFM by setting D = b/2 >> 1 in Eq. (20), 
Sect. 10.3. 





12.3 DELTA MODULATION AND PREDICTIVE CODING 


Sample values of analog waveforms derived from physical processes often exhibit 
predictability in the sense that the average change from sample to sample is small. 
Hence, you can make a reasonable guess of the next sample value based on previous 
values. The predicted value has some error, of course, but the range of the error 
should be much less than the peak-to-peak signal range. Predictive coded modula- 
tion schemes exploit this property by transmitting just the prediction errors. An 
identical prediction circuit at the destination combines the incoming errors with its 
own predicted values to reconstruct the waveform. 

Predictive methods work especially well with audio and video signals, and 
much effort has been devoted to prediction strategies for efficient voice and image 
transmission. Delta modulation (DM) employs prediction to simplify hardware in 
exchange for increased signaling rate compared to PCM. Differential pulse-code 
modulation (DPCM) reduces signaling rate but involves more elaborate hardware. 
We'll discuss both DM and DPCM in this section, along with the related and fasci- 
nating topic of speech synthesis using prediction. 


Delta Modulation 


Let an analog message waveform x(t) be lowpass filtered and sampled every T, sec- 
onds. We'll find it convenient here to use discrete-time notation, with the integer 
independent variable k representing the sampling instant t = KT,. We thus write x(k) 
as a shorthand for x(KT,), and so on. 

When the sampling frequency is greater than the Nyquist rate, we expect that 
x(k) roughly equals the previous sample value x(k — 1). Therefore, given the quan- 
tized sample value x,(k — 1), a reasonable guess for the next value would be 


Ek) = xj — 1) Ü 


where X,(k) denotes our prediction of x,(k). A delay line with time delay T, then 
serves as the prediction circuit. The difference between the predicted and actual 
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value can be expressed as 
xk) = E(k) + e) (2 


in which e,(k) is the prediction error. 

If we transmit €,(k), we can use the system in Fig. 12.3-1 to generate x,(k) by 
delaying the current output and adding it to the input. This system implements Eqs. (1) 
and (2), thereby acting as an accumulator. The accumulation effect is brought out by 
writing x(k) = €,(k) + x(k — 1) with x,(k — 1) = e(k — 1) + x(k — 2), and 
so forth; hence 


xalk) = ek) + ek — 1) + x(k ~ 2) 


e (k) + e(k — 1) + ek- 2) + 


i 


An integrator accomplishes the same accumulation when e,(k) takes the form of 
brief rectangular pulses. 

At the transmitting end, prediction errors are generated by the simple delta 
modulation system diagrammed in Fig. 12.3—2. The comparator serves as a binary 
quantizer with output values +A, depending on the difference between the pre- 
dicted value x,(k) and the unquantized sample x(k). Thus, the resulting DM signal is 


elk) = [sgn e(&)]A [3a] 
where 
e(k) = x(k) — X,(k) [3b] 
e) x(k) 
Delay T; 
T =k- 
Figure 12.3-1 Accumulator for delta modulation. 
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Figure 12.3-2 DM transmitter. 
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which represents the unquantized error. An accumulator (or integrator) in a feedback 
loop produces x,(k) from e,(k), similar to Fig. 12.3-1 except that the feedback sig- 
nal comes from the delayed output. 

Observe that this DM transmitter requires no analog-to-digital conversion other 
than the comparator. Also observe that an accumulator like Fig. 12.3-1 performs the 
digital-to-analog conversion at the receiver, reconstructing x,(k) from e,(k). Thus, 


ù EEES 
v dd vta asia 
de pinea t 





The name delta modulation reflects the fact that each input sample x(k) has 
been encoded as a single pulse of height + A or — A. But we can also view e,(X) as 
a binary waveform with signaling rate r, = f, or one bit per sample. For this reason 
DM is sometimes called “one-bit PCM." The corresponding transmission band- 
width requirement is 


B,mry2-f2 [4] 


We get by with just one bit per sample because we're transmitting prediction errors, 
not sample values. Nonetheless, successful operation requires rather high sampling 
rates, as we'll soon see. 

Figure 12.3-3 depicts illustrative continuous-time waveforms x(t), X(t), and 
€,(t) involved in DM. The staircase waveform x,(t) at the receiver differs from X (t) 
only by a time shift of T, seconds. The transmitter starts with an arbitrary initial pre- 
diction such as X,(0) < x(0) so e,(0) = +A. Then e,(0) is fed back through the 
accumulator to form the updated prediction X,(7;) = x,(0) + €,(0). Continual 
updating at each sampling instant causes X,(t) to increase by steps of A until the 
start-up interval ends when X,(kT,) > x(kT,) and e€,(kT,) = —A. If x(t) remains 
constant, X,(t) takes on a hunting behavior. When x(t) varies with time, X,(t) fol- 
lows it in stepwise fashion as long as the rate of change does not exceed the DM 
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Figure 12.3-3 DM wavetorms. 
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tracking capability. The difference between X, (t) and x(f) is called granular noise, 
analogous to quantization noise in PCM. The reconstructed and smoothed wave- 
form at the receiver will be a reasonable approximation for x(t) if A and T, are suffi- 
ciently small. 

But when x(7) increases or decreases too rapidly, X,(t) lags behind and we have 
the phenomenon known as slope overload, a fundamental limitation of DM. Since 
X (t) changes by +A every T, = 1/f, seconds, the maximum DM slope is *f;A and 
a sufficient condition for slope tracking is 


where x(t) = dx/dt. Consider, for instance, the modulating tone x(t) = 
A, cos 27 f,t so x(t) = —25 f, A, sin 2m f, t and |x(t)|max = 27 f, As <= 27W, 
where the upper bound incorporates our message conventions A, - 1 and 
fa € W. Equation (5) therefore calls for a high sampling frequency 
f; 2 20W/A >> 2W,since we want A << 2 to make the steps of X, (t) small com- 
pared to the peak-to-peak signal range —1 = x(t) = 1. 

DM performance quality depends on the granular noise, slope-overload noise, 
and regeneration errors. However, only granular noise has significant effect under 
normal operating conditions, which we assume hereafter. Even so, the analysis of 
granular noise is a difficult problem best tackled by computer simulation for accu- 
rate results or by approximations for rough results. 

We’ll estimate DM performance using the receiver modeled by Fig. 12.3-4a 
with e(t — 7,) = €,(k — 1) at the input to the accumulator. Equations (1) and (35) 
then give the accumulator output as x(k — 1) = X4(k) = x(k) — e(k) or 


X(t) = x(t) — e(t) [6] 
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Figure 12.3-4 (a) DM receiver; (b) granular noise waveform. 
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where e(t) is the granular noise waveform sketched in Fig. 12.3-4b. The shape of 
e(t) and the fact that |e(t)| < A suggest a uniform amplitude distribution with 
€ = AS 
Furthermore, experimental studies confirm that the power spectrum of e(t) is essen- 
tially flat over | f | = 1/T, = f,. Thus, 
GÁf) ~ &/2f, fl Sf 


and lowpass filtering yields 


W 
“=l Gdf)df=> d-—— u 
-W 


which is the average power of the granular noise component of x,(t). 
When granular noise is the only contamination in the filtered output, we obtain 


the signal-to-noise ratio 
S S, 3f, 
eres 
N/p N A^W 


8 


This result is almost identical to the PCM expression (S/N)p = 3q?S, if f, = 2W 
and A = 1/q. But f, and A must satisfy Eq. (5). We therefore need a more general 
relationship for the slope-tracking condition. 

Recall from Eq. (21), Sect. 9.2, that if G,( f) is the power spectrum of x(2), then 
the power spectrum of the derivative x(t) is (27)? G (f). Hence, the mean square 
signal slope can be put in the form 


ROP = | CAFON) df= Qro Was) 9 
where e = Vs. is the signal’s rms value and W,ms is its rms bandwidth, defined by 
X 1 oo 1/2 
Was = H | FG) af [10] 
Now we introduce the so-called slope loading factor 
a fA 
= —— 1 
"^ 2mo Weng m 


which is the ratio of the maximum DM slope to the rms signal slope. 
A reasonably large value of s ensures negligible slope overload. Hence, we'll 
incorporate this factor explicitly in Eq. (8) by writing A in terms of s from Eq. (11). 


Thus, 
(=) "T Bhan ASI 
N/p 4m^sW2.W 
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6 ( W J p? 
x EC [12] 
qu A MU 
E where 
[ b 2 fW 13] 


The parameter b equals our usual bandwidth ratio B;/W when B, has the minimum 
3 value f,/2. Equation (12) brings out the fact that DM performance falls between 
| " PCM and PPM since the wideband noise reduction goes as P? rather than P? or expo- 
nentially. DM generally requires a larger transmission bandwidth than PCM to 
achieve the same signal-to-noise ratio, so its applications are limited to those cases 

| where ease of implementation takes precedence over bandwidth considerations. 
í Computer simulations by Abate (1967) indicate that Eq. (12) holds for 
- ]n2b x s « 8. If s < 1n2b, then slope-overload noise dominates and (S/N )p 
| l drops off quite rapidly. Figure 12.3-5 illustrates how (S/N )p varies with s. For a 
specified bandwidth ratio, DM performance is maximized by taking the empirically 

determined optimum slope loading factor 


| Sop ~= In 2b [14] 


The maximum value of (S/N )p is then given by Eq. (12) with s = Sop. 
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Figure 12.3-5 DM performance versus slope loading factor. 
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EXAMPLE 12.3-1 DM Voice Transmission 





When a typical voice signal has been prefiltered so that W ~ 4 kHz, its rms band- E 
width will be Wms = 1.3 KHz. Substituting these values and s = s,, = ln 2b in i 
Eq. (12) yields the maximum DM signal-to-noise ratio (S/N), ~ 5.8b?/(In 2b)?. 
If b = 16, then (S/N), = 33 dB which is comparable to binary PCM with v = 7 
and u-law companding. But the DM signal requires r, = f, = 2bW = 128 kbps 
and Br = 64 kHz, whereas the PCM signal would have r, = vf, = 56 kbps and 
Br 28 kHz. 





EXERCISE 12.3-1 








Consider a signal with a uniform power spectrum G,(f) = (S,/2W)11(f/2W). 
Show that Wms = W/ V3. Then calculate the optimum value of A in terms of S, 
when b = 16. 








Delta-Sigma Modulation 


The preceding DM scheme takes the derivative of the input. In cases where the data 
are noisy, the noise can cause cumulative errors in the demodulated signal. There is 
further difficulty if the signal has a significant dc component. An alternative to the 
DM is a Delta-Sigma Modulator (DSM), also called sigma-delta modulation. 

We first consider an equivalent version of the conventional delta modulator 
transmitter/receiver system as shown in Fig. 12.3-6a. Now let's add an integrator (or 
accumulator) to the input which has the effect of preemphasizing the low frequencies. 
This can also integrate the signal in the feedback path and thereby eliminate the feed- 
back integrator. To compensate for the additional integrator, we then add a differen- 
tiator at the receiver. However, since the DM receiver already has an integrator, which 
is the inverse of a differentiator, we can eliminate both of these at the receiver and 
thus simplify its design, giving us the system shown in Fig. 12.3—6b. 


Adaptive Delta Modulation 


Adaptive delta modulation (ADM) involves additional hardware designed to pro- 
vide variable step size, thereby reducing slope-overload effects without increasing 
the granular noise. A reexamination of Fig. 12.3-3 reveals that slope overload 
appears in €,(t) as a sequence of pulses having the same polarity, whereas the polar- 
ity tends to alternate when x,(t) tracks x(t). This sequence information can be uti- 
lized to adapt the step size in accordance with the signal's characteristics. 

Figure 12.3—7 portrays the action of an ADM transmitter in which the step size 
in the feedback loop is adjusted by a variable gain g(k) such that 


gk) = Z(k — 1) + ek - De — 1) 


The step-size controller carries out the adjustment algorithm 












Integrator LPF yp) 
(accumulator) 





Integrator 
(accumulator) 





| E Transmitter © Receiver (message recovery) 


(a) 






yptt) 






Integrator 


LPF 
(accumulator) 


Transmitter Receiver (message recovery) 


| , 


Figure 12.3-6 {a] DM system; (6) DSM system. 
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| Figure 12.3-7 {a} Adaptive DM transmitter; (b) waveforms. 
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—fek-1)XxK  e(k)-e(k-1) 
Blk) = b. - 1yK E(k) * e(k — 1) 


where K is a constant taken to be in the range 1 < K < 2. Thus, the effective step 
size increases by successive powers of K during slope-overload conditions, as signi- 
fied by €,(k) = €,(k — 1), but decreases when e(k) # €,(k — 1). Another adaptive 
scheme called continuously variable slope delta modulation (CVSDM) provides a 
continuous range of step-size adjustment instead of a set of discrete values. 

The signal-to-noise ratio of ADM is typically 8—14 dB better than ordinary DM. 
Furthermore, the variable step size yields a wider dynamic range for changing val- 
ues of $,, similar to the effect of u-law companding in PCM. As a net result, ADM 
voice transmission gets by with a bandwidth ratio of b = 6-8 or By = 24—32 kHz. 





o Ea ac laud a ai nate 


Differential PCM 


Differential pulse-code modulation (DPCM) combines prediction with multilevel 
quantizing and coding. The transmitter diagrammed in Fig. 12.3-8a has a g-level 
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Figure 12.3-8 Differential PCM. (a) Transmitter; (b) prediction circuit; (c] receiver. 
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quantizer with quantum levels at XA, €34,..., *(g — 1)A. The unquantized 
error x(k) — X,(k) is applied to the quantizer to produce the prediction error e,(&) 
which is then encoded as a binary word with v — log; q bits, just like binary PCM. 
DPCM transmission therefore requires 


ry = vf Br = uf,/2 


However, since x,(k) changes as much as +(q — 1)A from sample to sample, the 
slope tracking condition becomes 


Fla zi 1)A = LECE) fmax 
If g >> 1 and Wms << W, the sampling frequency can be nearly as low as the 
Nyquist rate. 
Multilevel quantization of the prediction error obviously provides better infor- 
mation for message reconstruction at the receiver, To gain full advantage of this 


potential, the DPCM prediction circuit usually takes the form of a transversal filter 
shown in Fig. 12.3-8b, where 


Xk) = > ng exp 


so the predictor draws upon the previous n samples. The tap gains c; are chosen 
to minimize the mean square value of the error x(k) — x,(k). The receiver in 
Fig. 12.3-8c includes an identical prediction filter after the decoder. 


Assuming q >> 1 and. no slope overload, DPCM performs essentially like 
PCM enhanced by a prediction gain G, such that 


(=) = G,3q’S [15] 
N A P q >x 


The gain of an optimum predictor is given by 
-1 


= E - $ cip ` [6d] 
i=] 


where p; = R,{iT,)/S, is the normalized signal correlation and the tap gains satisfy 
the matrix relationship 


Po Pi i Pn-1 || Cy Pi 

was E C 
J Po e 2 = = a (16b] 
Pn-1 Dn-3 Tes Po C Pn 


Jayant and Noll (1984) outline the derivation of Eqs. (15) and (16) and present 
experimental data showing that G, ^ 5-10 dB for voice signals. The higher correla- 
tion of a TV video signal results in prediction gains of about 12 dB. 
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In contrast to delta modulation, DPCM employs more elaborate hardware than 
PCM for the purpose of improving performance quality or reducing the signaling rate 
and transmission bandwidth. Adaptive DPCM (ADPCM) achieves even greater 
improvement by adapting the quantizer or predictor or both to the signal characteristics. 
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EXERCISE 12.3-2 Suppose a DPCM predictor yields G, = 6 dB. Show that the DPCM word needs one 
less bit than that of binary PCM, all other factors being equal. Hint: See Eq. (7), 
Sect. 12.1. 





LPC Speech Synthesis 


Linear predictive coding (LPC) is a novel approach to digital representation of 
analog signals. The method uses a transversal filter (or its digital-circuit equivalent) 
plus some auxiliary components to synthesize the waveform in question. The param- 
eters of the waveform synthesizer are then encoded for transmission, instead of the 
actual signal. Considerable efficiency results if the synthesizer accurately mimics 
the analog process. Since there already exists extensive knowledge about speech 
processes, LPC is particularly well suited to speech synthesis and transmission. 

Figure 12.3—9 diagrams a speech synthesizer consisting of two input genera- 
tors, a variable-gain amplifier, and a transversal filter in a feedback loop. The ampli- 
fier gain and filter tap gains are adjusted to model the acoustical properties of the 
vocal tract. Unvoiced speech (such as hissing sound) is produced by connecting the 
white-noise generator. Voiced speech is produced by connecting the impulse-train 
generator set at an appropriate pitch frequency. 

If the filter has about 10 tap gains, and all parameter values are updated every 
10 to 25 ms, the synthesized speech is quite intelligible, although it may sound 
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Figure 12.3-9 Speech synthesizer. 
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rather artificial, like a robot. Some talking toys and recorded-message systems gen- 
erate speech sounds by the synthesis method, drawing upon parameter values stored 
in a digital memory. Systems that generate speech in this manner are called 
vocoders. An alternative to vocoders are waveform encoders where a given algo- 
rithm such as PCM is employed to replicate a given speech signal. 

Now consider the LPC transmitter of Fig. 12.3-10a. Sample values of voice 
input are analyzed to determine the parameters for the synthesizer, whose output is 
compared with the input. The resulting error 1s encoded along with the parameter 
values to form the transmitted digital signal. The receiver in Fig. 12.3-10b uses the 
parameter values and quantized error to reconstruct the voice waveform. 

A complete LPC codeword consists of 80 bits—1 bit for the voiced/unvoiced 
Switch, 6 bits for the pitch frequency, 5 for the amplifier gain, 6 for each of the 10 
tap gains, and a few bits for the error. Updating the parameters every 10-25 ms is 
equivalent to sampling at 40-100 Hz, so LPC requires a very modest bit rate in the 
vicinity of 3000 to 8000 bps. Table 12.3-1 compares LPC with other voice encoding 
methods. The substantial bit-rate reduction made possible by LPC has stimulated 
efforts to improve the quality of speech synthesis for voice communication. In fact, 
as shown in Table 12.3-1, using LPC we can compress speech from a 56-kbps to 3- 
kbps rate. See Rabiner and Schafer (1978) for a general introduction to digital pro- 
cessing applied to speech signals. 
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Figure 12.3-10 LPC transmission system. (a) Transmitter; {b} receiver. 
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Table 12.3-1 Comparison of voice encoding methods 

Encoding Sampling Bits per Bit rate, 
method rate, kHz sample kbps 
DM 64-128 1 64-128 
PCM 8 7-8 56-64 
ADM 48-64 1 48-64 
DPCM 8 4-6 32-48 
ADPCM 8 3-4 24-32 
LPC 0.04—0.1 = 80 3-8 








12.4 DIGITAL AUDIO RECORDING 


Much of our communication infrastructure deals with the transmission of signals 
whose content is music, so we are interested in preserving fidelity, dynamic range, 
and so on. The study of digital audio recording provides us with a good lesson in the 
advantages of digital over analog systems and some methods we can employ to 
maintain signal quality. 

Analog audio storage media such as magnetic tape have several quality limita- 
tions. The media itself wears out with constant use or degenerates over time: tapes 
stretch out and exhibit wow and flutter. Their dynamic range is usually limited to 
70 dB, whereas a live orchestra can have a dynamic range of 100 to 120 dB. There- 
fore, to record the music and accommodate the wide range of amplitudes, the 
recording level has to be adjusted so that soft music isn’t lost, and loud music 
doesn’t saturate the amplifier. 

The recent development of compact disk (CD) technology for digital recording 
of audio signals is a significant advancement in the art of music recording. A CD 
consists of a plastic disk about 120 mm in diameter. It has about 20,000 tracks, each 
having a width of 0.5 um and. spaced 1.6 jum apart. Each track has a series of micro- 
scopic pits that stick out from the plastic. The regions between pits are called lands. 
The presence or absence of a pit is detected and converted to a binary electrical sig- 
nal by the laser diode and photodiode system of Fig. 12.4—1. The laser light reflects 
off the pit and is transmitted into the photodiode via the beam splitter. Because the 
laser beam is focused at the specific depth, fingerprints, dust, scratches and other 
surface defects are blurred and therefore usually not detected by the system. Unlike 
magnetic media, there 1s no mechanical contact with the surface and thus no wear on 
the surface. Figure 12.4—2 shows the equivalent electrical signal of the programmed 
tracks. Note that a binary 1 occurs when there is a transition from pit to land or vice 
versa. The CD has a total track length of 5300 m and a scan velocity of 1.2 m/s, thus 
giving a total playing time of 74 minutes. 
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| Figure 12.4-1 Optical readout system for a CD. 
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| Figure 12.4-2 CD NRZI signal corresponding to programmed pits. 
| CD Recording 


The system for recording left and right channel audio information to a CD is shown in 
Fig. 12.43. With high-fidelity music having a bandwidth of 20 kHz, the signal is sam- 
pled at 44.1 kHz to meet the Nyquist criterion and provide a guard band to minimize 
ahasing. The system has an analog antialiasing LPF which may also incorporate pre- 
emphasis. If preemphasis is included, a control word is also recorded to activate deem- 
| phasis during playback. Prior to PCM, a small amount of dither, or white noise, is 
added to mask any granular noise. The signal is then digitized using PCM with 16-bit 
uniform quantization. For an ADC, the dynamic range is defined by 20 log (2") dB, and 
thus a 16-bit system has a dynamic range of 20 log (215) = 96 dB. With a sample rate of 
44.1 kHz and 16 bits of quantization, the PCM has an output bit rate of 705.6 kpbs. 
The two PCM outputs are sent to the Cross Interleave Reed-Solomon Error 
| Control Code (CIRC) stage giving an effective bit rate of 1.4112 Mbps. The purpose 
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Figure 12.4-3 CD audio recording system. 


of the CIRC is to encode the data with parity or check symbols to enable error cor- 
rection during the playback process. Reed-Solomon codes are used for this purpose. 
Encoding occurs as follows: (a) Each 16-bit PCM signal is split into two 8-bit sym- 
bols. (b) Twelve symbols from each channel (i.e., twenty-four 8-bit symbols) are 
scrambled and then applied to the first CIRC encoder. (c) This encoder inserts four 
8-bit parity symbols into the 24-symbol sequence to generate a 28-symbol sequence. 
These extra symbols are called Q words and enable error correction. (d) The symbol 
sequence is then dispersed according to a specified system of interleaving. Inter- 
leaving symbols minimizes the effects of burst-type errors where noise and media 
defects cause successive symbols to be corrupted. Interleaving also better enables 
the Reed-Solomon codes to correct errors. (e) The interleaved words undergo 
another encoding process in which an additional four 8-bit parity symbols called P 
words are added to the 28-symbol sequence. The end result is that the CIRC module 
converts a 24-symbol sequence into a 32-symbol sequence. | 

The control-word module adds an 8-bit symbol to each 32-symbol data block 
from the CIRC. This extra word is used to provide information about track separa- 
tion, end of disk flags, track number, and so forth. 

The eight-to-fourteen (EFM) module takes each 8-bit symbol from the control 
word module and converts it to a 14-bit symbol. Adding these additional bits 
reduces the signal bandwidth and dc content, and adds additional synchronization 
information, the purpose being to improve the recording and playback process and 
to allow for easier manufacturing tolerances for the lands/pits. Recall that an 8-bit 
code can represent 256 possible bit combinations, and a 14-bit code represents 
16,384 combinations. Thus with EFM encoding, we chose 256 out of the 16,384 
possibilities to represent the data from the control word module. To further reduce 
the signal's dc content, the EFM stage adds three additional bits we call merging bits 
to each 14-bit word. That is, each binary 1 1s separated by at least two but not more 
than ten Os, and the signal has a length that varies between three and eleven clock 
periods. The bit stream is then converted into a nonreturn-to-zero inverse (NRZI) 
format as shown in Fig. 12.4-2. Thus we have added an additional 9 bits to the 
incoming 8-bit symbol. 
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The data is then organized into blocks called frames, each one consisting of 
thirty-three 17-bit symbols (i.e., 561 bits). In order to indicate the start of each 
frame, and to specify the player's motor frequency, an additional 17-bit sync word 
and three merging bits are added, resulting in a frame length of 588 bits. The data 
output r, is then written into the CD. Therefore, starting with the 1.4112 Mbs data 
rate at the input of the CIRC, the output rate 1s calculated as follows: 


32 33 17 
r, = 1.4112 Mbps X (2) X (3)... 5 (2) 
24 CIRC 32 word 8 EFM 


coding 


x (=) 4.3218 Mb 1/T, 
sync word T ^- S= 
561 "cR P : 
bits 


Given that a 1 is separated from two to ten Os, a bit rate of 4.3218 Mbps, and a 


scan velocity of 1.2 m/s means the CD land/pit size varies from 0.833 wm (3T,) to 
3.054 um (117,). 





EXERCISE 12.4-1 


For a CD, calculate the number of music information bits in each frame and the 
frame rate. 








CD Playback 


The CD playback process is shown in Fig. 12.4—4. The decoder decodes the EFM 
signal, removes the merging bits, and extracts sync and control word information. 
The signal then goes to the error detector. 

CD technology does a superior job of error control. CDs contain two types of 
errors. Random errors are caused by air bubbles or pit inaccuracies in the CD mate- 
rial, and burst errors are caused by scratches, fingerprints, and so forth. When errors 
do occur, they manifest themselves by "click" sounds at playback. Error control is 
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Figure 12.4-4 CD audio playback. 
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done by correction or concealment in the following order. First we employ check 
digits to correct errors that occur. If a specific bitis incorrect, we simply change it to 
the opposite state. On the other hand, if the bit was received with ambiguity, and 
there is enough redundancy in the code, we can use the redundancy to determine the 
correct data value. Otherwise we conceal the incorrect value by interpolating 
between neighboring samples. If that isn't satisfactory, the remaining choice 1s to 
mute the unreliable data value. As long as muting does not exceed a few millisec- 
onds, it is not noticed by the listener. 

At this point, we would expect to use a 16-bit DAC to reconstruct the signal. 
However, 16-bit DACs are relatively expensive, and a more economical and clever 
alternative exists. We first up-sample the data from the error detector by inserting 
N — 1 all zero words between the PCM samples, thus effectively increasing the 
sampling rate to f; = Nf. A digital LPF interpolates between the original samples. 
We usually use N = 256, so the sample rate increases from f, = 44.1 kHz to 
f, = 11.3 MHz. This massive increase in sampling rate causes the amplitude differ- 
ences between successive samples to be relatively small and therefore allows us to 
use a delta-sigma modulator as a 1-bit DAC. A Butterworth analog LPF with a band- 
width of 20 kHz completes the reconstruction process. See Baert et al. (1998) for 
more information on audio recording. 


12.5 DIGITAL MULTIPLEXING 


Analog signal multiplexing was previously discussed under the heading frequency- 
division and time-division multiplexing. While those same techniques could be 
applied to waveforms representing digital signals, we gain greater efficiency and 
flexibility by taking advantage of the inherent nature of a digital signal as a 
sequence of symbols. 





Itis similar to time-division multiplexing but free from the rigid constraints of peri- 
odic sampling and waveform preservation. 

More profoundly, however, the digital revolution has also eliminated the dis- 
tinctions between the various telecommunications services such as telephone, tele- 
vision, and the Internet. Instead, voice, video, graphic, or text information is 
encoded, or digitized, and simply becomes data to be multiplexed with other data 
and then transmitted over an available channel. It no longer matters whether or not 
the data originated from a voice or picture; it's all just data. 

In this section we want to first consider the general concepts and problems of 
digital multiplexing. The signals to be multiplexed may have come from digital data 
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sources or analog sources that have been digitally encoded. We'll consider specific 
cases including telephone system hierarchies, the Integrated Services Digital Net- 
work (ISDN), and the Synchronous Optical NETwork (SONET). We will also 
briefly consider data multiplexing schemes such as packet switching, frame relay, 
and asynchronous transfer mode (ATM). 


Multiplexers and Hierarchies 


A binary multiplexer (MUX) merges input bits from different sources into one sig- 
nal for transmission via a digital communication system. In other words, a MUX 
divides the capacity of the system between several pairs of input and output termi- 
nals. The multiplexed signal consists of source digits interleaved bit-by-bit or in 
clusters of bits (words or characters). 

Successful demultiplexing at the destination requires a carefully constructed 
multiplexed signal with a constant bit rate. Towards this end, a MUX usually must 
perform the four functional operations: 


1. Establish a frame as the smallest time interval containing at least one bit from 
every input. 


2. Assign to each input a number of unique bit slots within a frame. 
3. Insert control bits for frame identification and synchronization. 
4. Make allowance for any variations of the input bit rates. 


Bit rate variation poses the most vexing design problem in practice, and leads to 
three broad categories of multiplexers. 


Synchronous multiplexers are used when a master clock governs all sources, 
thereby eliminating bit-rate variations. Synchronous multiplexing systems attain the 
highest throughput efficiency, but they require elaborate provision for distributing 
the master-clock signal. 


Asynchronous multiplexers are used for digital data sources that operate in a start/stop 
mode, producing bursts of characters with variable spacing between bursts. Buffering 
and character interleaving make it possible to merge these sources into a synchronous 
multiplexed bit stream, as discussed later in conjunction with computer networks. 


Quasi-synchronous multiplexers are used when the input bit rates have the same 
nominal value but vary within specified bounds. These multiplexers, arranged in a 
hierarchy of increasing bit rates, constitute the building blocks of interconnected 
digital telecommunication systems. 


Two slightly different multiplexing patterns have been adopted for digital 
telecommunication: the AT&T hierarchy in North America and Japan and the CCIT 
hierarchy in Europe. (CCIT stands for International Telegraph and Telephone Con- 
sultive Committee of the International Telecommunications Union.) Both hierar- 
chies are based on a 64-kbps voice PCM unit, and have the same structural layout 
shown in Fig. 12.5-1. The third level is intended only for multiplexing purposes, 
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Figure 12.5-1 Multiplexing hierarchy for digital telecommunications. 


whereas the other three levels are designed for point-to-point transmission as well 
as multiplexing. The parameters of the AT&T and CCIT hierarchies are listed in 
Table 12.5-1. 

Observe in all cases that the output bit rate at a given level exceeds the sum of 
the input bit rates. This surplus allows for control bits and additional stuff bits 
needed to yield a steady output rate. Consequently, when we include the PCM band- 
width expansion, a digital network has very low bandwidth efficiency if devoted 
entirely to voice transmission. For instance, the fourth level of the AT&T multiplex- 
ing scheme requires B = r,/2 ~ 137 MHz to transmit 24 X 4 X 7 X 7 = 4032 
voice PCM signals, so the bandwidth efficiency is (4032 X 4kHz)/137 MHz 
= 12%. The old AT&T FDM hierarchy had a bandwidth efficiency of approxi- 
mately 85 percent. Thus digital multiplexing sacrifices analog bandwidth efficiency 
in exchange for the advantages of digital transmission. 

Previously noted advantages of digital transmission include hardware cost reduc- 
tion made possible by digital integrated circuits and power cost reduction made possi- 
ble by regenerative repeaters. Now we can begin to appreciate the flexibility made 





Table 12.5-1 Multiplexing hierarchies 














AT&T CCIT 
Number Output rate, Number Output rate, 
of inputs Mbps of inputs Mbps 
First Level 24 1.544 30 2.048 
Second Level 4 6.312 4 8.448 
Third Level 7 44.736 4 34.368 
Fourth Level 6 274.176 4 139.264 
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Figure 12.5-2 Illustrative configuration of the AT&T hierarchy. 


possible by digital multiplexing, since the input bit streams at any level in Fig. 12.5—1 
can be any desired mix of digital data and digitally encoded analog signals. 

By way of example, Fig. 12.5~2 shows an illustrative configuration of the 
AT&T hierarchy with voice, digital data, visual telephone, and color TV signals for 
transmission on the fourth-level T4 line. Each of the twenty-four 64-kbps encoded 
voice signals is referred to as a digital signal level zero (DSO). The first-level T1 sig- 
nals include PCM voice and multiplexed digital data. The T1 line is often referred to 
as a digital signal level one (DS1), the T2 line as DS2, and so on. The second-level T2 
signals are multiplexed T1 signals along with visual telephone signals encoded as 
binary differential PCM (DPCM) with f, ^ 2 MHz and v = 2 bits per word. PCM 
encoding of color TV requires a 90-Mbps bit rate (f, ~ 10 MHz, v = 9), so two 
third-level T3 lines are allocated to this signal. The higher-level multiplexers labeled 
M12, M23, and M34 belong to the quasi-synchronous class. Let's consider the first- 
level synchronous multiplexer called the channel bank. 





T1 Voice PCM Channel Bank 


Synchronous multiplexing of voice PCM requires that the signals be delivered in 
analog form to the channel bank. Then, as diagrammed in Fig. 12.5—3a, sequential 
sampling under the control of a local clock generates an analog TDM PAM signal 
(see Figs. 7.2-7 and 7.2-8). This signal is transformed by the encoder into TDM 
PCM with interleaved words. Finally, the processor appends framing and signaling 
information to produce the output T1 signal. 

The T1 frame structure is represented by Fig. 12.5—3b. Each frame contains one 
8-bit word from each of the 24 input channels plus one bit for framing, giving a total 
of 193 bits. The 8-kHz sampling frequency corresponds to 125-us frame duration, 
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Figure 12.5-3 (a) T1 channel bank; (b) frame structure. 


so the T1 bit rate is r, = 193 bits + 125 us = 1.544 Mbps. Signaling information 
(dial pulses, "busy" signals, etc.) is incorporated by a method aptly known as bit 
robbing. Every sixth frame, a signaling bit replaces the least-significant bit of each 
channel word—denoted by the starred bit locations in the figure. Bit-robbing 
reduces the effective voice-PCM word length to v = 72 and has inconsequential 
effect on reproduction quality. Yet it allows 24 signaling bits every 6 X 125 us, or 
an equivalent signaling rate of 32 kbps. 

T1 signals may be either combined at an M12 multiplexer or transmitted 
directly over short-haul links for local service up to 80 km. The T1 transmission line 
is a twisted-pair cable with regenerative repeaters every 2 km. A bipolar signal for- 
mat eliminates the problems of absolute polarity and dc transmission. 





EXERCISE 12.5-1 


Assume that the first-level multiplexer in the CCIT hierarchy is a synchronous 
voice-PCM channel bank with 30 input signals, output bit rate r, — 2.048 Mbps, 
and no bit-robbing. Find the number of framing plus signaling bits per frame. 





Digital Subscriber Lines 


The plain old telephone or POT is no longer used for just analog voice signals. It 
has become the channel for computer, fax, video and other digital data. With stan- 
dard voice-only telephone lines, the *'last-mile"' connection between the telephone 
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customer and the central office (CO) consists of a twisted-wire pair that feeds to a 
POT line card located at the CO. The POT card interfaces the voice line to a DSO 
signal connected to the rest of the telephone network. While the twisted pair of 
wires may be capable of carrying signals of up to 30 MHz, the POT card is 
designed for voice signals, and thus the bandwidth is limited to 3.2 kHz. If the 
computer interface is made via a modem (modulator/demodulator), the data rate is 
limited to 30 kbps or, in some cases, up to 56 kbps. This is not acceptable for 
video, interactive video, and other high-speed services. Instead, we want to con- 
sider a digital subscriber line (DSL) that can handle increased data rates. The 
term DSL is somewhat of a misnomer since DSL is primarily a set of standards 
that defines the CO interface, and may or may not affect the existing twisted pair 
cable arrangement. l 
There are several DSL standards; some of these include: 


1. Asymmetric Digital Subscriber Line (ADSL). This is an FDM system 
where the existing twisted pair cable supports three services: (a) POTs, (b) 640 
kbps digital data from subscriber to CO (upstream), and (c) 6.144 Mbps digital 
data from CO to subscriber (downstream). The ADSL system is shown in 
Fig. 12.5-4 with its corresponding spectrum shown in Fig. 12.5-5. 
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Figure 12.5-4 ADSL telephone twisted-wire pair interface. 
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Figure 12.5-5 Spectrum of an ADSL signal. 
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2. High Bit Rate Digital Subscriber Line (HDSL). Consists of one to three 
twisted-wire pairs to transmit and receive digital data from 1.544 to 2.048 Mbps. 


3. Symmetrical Digital Subscriber Line (SDSL). This is a single twisted pair 
version of HDSL and carries 0.192 to 2.23 Mbps digital data in both directions. 


4. Very High Bit Rate Digital Subscriber Line (VDSL). A single twisted pair 
line that can carry as much as 26 or 52 Mbps in either direction. 


5. Integrated Services Digital Network (ISDN). This may be a single twisted 
cable pair that allows for a data rate of at least 128 kbps and enables the inte- 
gration of voice, video, and other data sources. The specifics of ISDN will be 
considered in the next section. - 


See Dutta-Roy (2000) for more information on DSLs. 


Integrated Services Digital Network 


Integrated Services Digital Network (ISDN) is a TDM digital telephone network 
that integrates voice, video, computer, and other data sources. There are two ISDN 
structures: Basic Rate Interface (BRI) and Primary Rate Interface (PRI). The 
BRI consists of two B (bearer) 64-kbps channels and a 16-kbps D (delta) channel. 
It is also referred to as 2B + D. BRI is used primarily by residential and small busi- 
ness customers. In North America, Japan, and Korea, PRI has twenty-three 64-kbps 
B channels and one 64-kbps D channel, or 23B + D. The B and D channels are full 
duplex (i.e., both directions) and carry voice, computer data, and so on, while the D 
channel is generally used to transmit control, signaling, telemetry, or other connec- 
tion information. With BRI, the two B channels could carry two 64-kbps PCM voice 
messages or one 64-kbps voice message and up to six multiplexed 9600 bps data 
streams. Because of faster data rates and increased data channels, ISDN is an alter- 
native to the modem (modulator/demodulator) for interfacing home computers to 
the telephone lines. 

With BRI, the two B 64-kbps channels, the 16-kbps D channel, and some addi- 
tional overhead bits are multiplexed together for a 192-kbps output. Specifically, 16 
bits from each B channel and 4 bits from the D channel are combined with an addi- 
tional 12 bits of overhead to make a 48-bit frame. Each frame is transmitted with a 
period of 250 us to get an output bit stream rate of 192 kbps. PRI has a frame consist- 
ing of 8 bits from each of the 24 channels plus a framing bit used for synchronization 
and control yielding 193 bits (1.e., 24 X 8 + 1 = 193). The frame rate is 8000 frames 
per second yielding an output bit stream rate of 1.544 Mbps. The European and rest of 
the world standard for PRI consists of 30 B + 1 D 64-kbps channels where each 
frame is 256 bits long. Thus, with overhead, the output rate is 2.048 Mbps. 

The ISDN also has high-capacity H channels for information in excess of 64 kbps. 
These include video, high-resolution graphics, high-fidelity audio, HDTV, and so on. 
The channels and their respective capacities are shown in Table 12.5-2. Note that 
Channels HO and H11 are North American IDSNs, whereas H12 is used in Europe. 
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Table 12.5-2 ISDN H channel capacities 





Channel Capacity, kbps 
HO 384 
H11 1,536 
H12 1,920 
H21 32,768 
H22 44,160 
H4 135,168 


So far we have discussed what is now referred to as narrowband ISDN 
(N-ISDN). However ISDN specifications have been increased to handle data rates 
of 2.5 Gbps and beyond, creating what we now call broadband-ISDN (BISDN). 
BISDN enables such services as interactive video, HDTV, other multimedia, and of 
course allows the system to carry more services. 


Synchronous Optical Network 


Synchronous Optical NETwork (SONET) is a fiber-optic standard that enables 
multiple broadband signals and even relatively low-rate signals to be multiplexed 
onto a fiber-optic channel. The standard was developed by BellCore (Bell Commu- 
nications Research) and standardized by ANSI (American National Standards Insti- 
tute). The European equivalent is referred to as Synchronous Digital Hierarchy 
(SDH). A complete tutorial on SONET is available on the World Wide Web at loca- 
tion Attp://www.webproforum.com/sonet. We will present a brief description of its 
operation. 

Figure 12.5—6 shows a block diagram of a SONET system, whereas the details 
of the path-terminating element (PTE) and its associated multiplexing are shown in 
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Figure 12.5-6 SONET system. 
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Fig. 12.5—7. SONET communication works as follows. A communication's link, 
which we will refer to as a path, is established between the left- and right-most PTE. 
The PTE multiplexes various input electrical signals to generate a single optical sig- 
nal and vice versa. First the PTE multiplexes and formats the input data to generate a 
single 51,840 Mbps synchronous transport signal (STS-1) at the output. It also 
demultiplexes the STS signal at the destination. The inputs can be a single broadband 
signal such as an HDTV or multiple low-rate signals that are mapped into a virtual 
tributary (VT) which, in turn, is mapped to the STS- 1 signal. The STS-1 output is an 
electrical signal with a frame structure such that each frame has 90 bytes X 
8 bits/byte X 9 rows/frame X 8000 frames/sec and yields an STS-1 signal rate of 
51.840 Mbps. In the case of SONETS, bytes are often referred to as octets. 

The N STS-1 signals are then byte multiplexed to generate an STS-N signal. It 
is scrambled and fed to an electrical-to-optical converter where it is converted to an 
equivalent optical OC-N signal at some specified wavelength. The signal then trav- 
els over a fiber-optic channel to its destination. It is possible to multiplex additional 
optical wavelengths over the channel. The optical channel may contain regeneration 
to overcome path losses. The channel may also contain add/drop multiplexers 
(ADM) that enable additional STS signals to be added or dropped from the channel. 
For example, when a signal arrives at the ADM, the header information is examined 
to determine if a section of data is to proceed to the same PTE as the rest of the STS 
data or is to be routed to another PTE. The input/output lines of the ADM are OC- 
Ns, but some ADMs may have electrical input/outputs as well. A section defines a 





E 
d 
A 
E 

















































DSi 
28 1.544 | MUX & STS-1 
DSi Mbps frame 
Signals adapter | 7184 Mbps 
1.544 
Mbps 
MUX & 
PS3 || frame 
ae adapter B 
bps yte 
interleaved Scrambler sees 
STS-1 MUX em 
Video | MUX& eres 
— frame [ STSl | 
135 Mbps | adapter. === 
MPEG-2 (NTSC-TV) 
15 Mbps 
PS _DS2_|| mux & STS-1 
6.3 Mbps | frame —— 
DS2 .' adapter 
ps2 
be > ai 





Figure 12.5-7 Path-terminating elements. 
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Figure 12.5-8 STS-1 SONET frame, 9 rows X 90 bytes per row. 


link between a PTE and regeneration or regeneration and the ADM. A line defines 
the link between the PTE and the ADM. 

The frame structure is shown in Fig. 12.5-8. The data in the first three columns 
are transport overhead (TOH). In the TOH, the first three rows are section over- 
head (SOH), and the next six rows are line overhead (LOH). The data in the fourth 
column is path overhead (POH). The remaining 86 columns X 9 rows are for user 
data. The user data and path overhead constitute the synchronous payload enve- 
lope (SPE). Each of the overhead sections defines the addressing and multiplexing 
functions at their particular level. The SOH, LOH, and POH also contain informa- 
tion relating to framing, coding, synchronization, performance monitoring, and sta- 
tus. See Attp://www.webproforum.com/sonet for more specific information. 


Data Multiplexers 


Multiplexers for computer communication depart in two significant respects from 
general-purpose telecommunications service. On the one hand, complications arise 
from the fact that each computer has its own independent clock and operates in an 
asynchronous start/stop mode. On the other hand, simplifications come from the 
fact that computers don’t require the nearly instantaneous response needed for two- 
way voice communication. Consequently buffering is essential and the associated 
time delay is tolerable. 

Conventional communication links such as a telephone T1 line employ circuit 
switching in which a dedicated line is assigned to connect the source and destina- 
tion. This is the case whether the line is available at all times, or is shared by other 
users in a TDM scheme shown in Figs. 7.2-7 or 12.5—2. Consequently, circuit 
switching is not very efficient if the data are bursty or intermittent. 
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Message switching, or statistical time division multiplexing, on the other 
hand, only uses a channel when we want to send a specific message. After the mes- 
sage is sent the channel is relinquished for another message from the same or 
another source. Message switching is also called store and forward switching 
because when a particular message reaches a node, it may have to be stored until a 
connection to the next node is available. While store and forward switching is inher- 
ently more efficient than circuit switching, it may involve significant time delays— 
delays that would be unacceptable in two-way voice communication. But such 
delays are not a problem for applications like the Internet since Internet users can 
tolerate brief turnaround lags. Having said this, however, improvements in router 
technology (i.e., the device that determines the data route) have greatly reduced the 
latency time in store and forward switching, which is thus becoming an alternative 
to circuit switching for telephone systems. 

The first message switching system we'll consider is packet switching. Here 
the message length is limited to short blocks of data called packets. The packets 
may be sent using a dedicated line we call a virtual circuit such that when all the 
packets are sent, the line is given up. Virtual and switched circuits may be the same 
except that the virtual circuit is only used for the duration of the message. A packet 
has a header that contains the router information that specifies the location to be 
routed to. The packet also includes extra bits for error control purposes. Packet 
switching poses some interesting problems since, at any given time, the network 
contains numerous packets attempting to reach their respective destinations. With 
packet switching, multiple packets from the same source may take different paths 
depending on availability, may arrive in scrambled order, or may in fact get lost. If 
too many packets have entered a switch and compete for available lines and buffer 
space, we have a problem of contention. If the input rate to the switch exceeds the 
output rate, the excess packets are stored in a buffer. However, if the buffer is not 
large enough to store these excess packets, some may get discarded. If this occurs 
relatively frequently, the switch is said to be congested. 

Packet switching is capable of working at data rates up to 64 kbps. The rate is 
limited because much of the packet contains error control information. Frame relay 
switching is similar to packet switching, but uses variable length packets called 
frames. Much of the error bits have been removed and thus frame relay can achieve 
significantly faster data rates, up to 2 Mbps. Asynchronous transfer mode (ATM), 
or cell-relay, has even lower overhead than frame relay and uses fixed-length pack- 
ets called cells. An extremely broadband switching scheme with rates in the 10s and 
100s of Mbps, it allows for multiple virtual channels with the rate of each one 
dynamically set on demand. ATM is used in conjunction with BISDN and SONET 
to carry widely different services (voice, video, business information, etc.). 








kbps but, because of the intermittent nature of speech, 1s utilized only 10 percent of 
the time. We want to transmit a fax page consisting of 8 million bits. If we employ 
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message switching using all the channels that are available 90 percent of the time, 
our waiting time is 1/(24 X 64 kbps X 0.9) = 0.72 us/bit X 8 million bits = 

5.8 seconds. On the other hand, if we use circuit switching and only one of the TDM 
channels, our waiting time is 1/(64 kbps) = 16.6 us/bit X 8 million bits = 

125 seconds. Therefore, if we can tolerate the intermittent delays associated with 
store-and-forward switching, we have a relatively efficient method of quickly trans- 
ferring data. 
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In 1969, the Advanced Research Projects Agency (ARPA) of the US Department of 
Defense established the ARPANET, the acronym for Advanced Research Projects 
Agency NETwork, for the interconnection of computers and terminals. It was a 
packet-switched network. A model is shown in Fig. 12.6-1. Here the network con- 
sists of five computers, symbolized by circles, and four terminal interfaces, symbol- 
ized by squares. Each circle or square represents a communication node equipped 
with a concentrator having the ability to transmit, receive, and store data. The 
ARPANET evolved and has become what is commonly known as the Internet. 
The Internet has enabled the interconnection of millions of computers world- 
wide via telephone lines, satellite links, cable TV networks, RF (wireless), and other 
media. Its rapid growth in the last several years has been fueled by low-cost hardware 
connections, low-cost personal computers, and easy-to-use software interfaces in the 
form of email and web browsers. The available technology has brought the Internet 
into the communication mainstream just like the telephone 100 years ago. With 
increasing bandwidth on existing and new channels, the Internet will be able to 
deliver interactive video and other multimedia information (i.e., streaming audio and 
video), and thus may rival broadcast and cable TV for video/sound entertainment. 





Figure 12.6-1 Computer network. 
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There are several types of computer networks that can be interconnected via the 
Internet. Local area networks (LAN) allow computer systems in a building or 
within an enterprise to be connected together and share common databases, pro- 
grams, and other sources of information. Similarly, other networks cover a large 
area of the country and are called wide area networks (WAN). An example would 
be banking systems, in which branch computers and automatic teller machines are 
interconnected. Finally, there are global area networks (GAN), which enable 
worldwide network interconnection. In the remainder of this section we will look at 
two network models: the Open Systems Interconnection and the Transmission Con- 
trol Protocol/Internet Protocol. 


Open Systems Interconnection 


Open Systems Interconnection (OSI) was developed in the late 1970s by the Inter- 
national Standards Organization (ISO) in association with the International 
Telecommunications Union (ITU) to be a model for the standardization of computer 
networks. Open" means that any two systems can communicate if they adhere to 
the OSI model. 

Figure 12.6-2 shows the OSI model, which consists of seven layers. Layers 1 to 
3 are part of the network connection, while layers 4 to 7 are part of the data origina- 
tor (source) or recipient (destination). This architecture allows any one layer to be 
modified or improved without affecting the other layers. The layers are described as 
follows. 








Layer f Layer 

7 Application Application 7 

6 Presentation Presentation 6 

5 Session Computer Session 5 

ee 
4 Transport | Transport 4 
E — M —— à 

3 Network | Network 3 

2 Data link Network Data link 2 

i Physical Physical 1 
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Figure 12.6-2 OSi architecture. 

















12.6 Computer Networks 


1. Physical. This is where bit stream information is inserted into the network. 
Here we are directed to topics covered in Chapter 11 and the earlier part of this 
chapter. The physical layer includes electrical and mechanical standards (i.e., 
electrical signals, connectors, etc.). Examples include the Ethernet, CCITT 
1.430 BISDN, and the ISO-8877 ISDN connector. 


2. Data link. Structures the data into frames so that each one includes bits for 
synchronization, error detection/correction, and the beginning and end of a 
frame, plus the source’s and destination’s addresses. It also sends and receives 
acknowledgements if the frame is received correctly. For example, if a burst 
error occurred and a frame was corrupted, the software in the data link layer 
would cause the source to retransmit the frame. If an acknowledgment signal 
was not received by the source, the source would then attempt to send a duplicate 
frame. Therefore, the data link layer would minimize the problem of duplicate 
frames. The data link layer along with the other layers manages the data flow so 
a source with a high data rate does not overwhelm a slow destination. A data link 
layer example is the CCIT 1.441/Q.921 ISDN Data Link Protocol (LAPD). 


3. Network. Decides how the data is to be routed from node to node, and is used 
to manage data flow if there is network congestion. This layer may also be 
involved in sending and receiving acknowledgments. 


4. Transport. It either breaks the message data from the session layer into pack- 
ets (or some other unit) or takes the packets from the network layer and con- 
nects them to form a message. 


5. Session. Establishes connections between computers or reestablishes the con- 
nection in the event of a failure. It also links applications such as audio and 
video services and authenticates network users. 


6. Presentation. Performs data formatting function such as code conversion (e.g 


ot 


ASCII to EBCDIC), encryption/decryption, and data compression/decompression. 


7. Applications. This is the user interface to the OSI system and includes such 
services as file transfer programs, airline reservation data, email or web browsers. 


In practical systems, some or all of these layers can be integrated at various 
degrees into a single hardware device. For example, some personal computers have 
a plug-in modem card (i.e., stand-alone modem) that performs the functions of lay- 
ers 1-3 while the rest of the computer performs the functions of layers 4-7. On the 
other hand, some personal computers have a built-in modem that shares the com- 
puter’s CPU at a lower priority to perform the functions of layers 1-3, using a 
process known as cycle stealing. Of course, with this system, if there is a great deal 
of interaction with the network, the system slows down. 


Transmission Control Protocol/Internet Protocol 


The Transmission Control Protocol/Internet Protocol (TCP/IP) was developed in 
conjunction with the original ARPANET and preceded the ISO model. While the ISO 
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model has served as a guideline for network architectures and can certainly help us 
understand its functions, it has never become the standard its creators hoped it would. 
The TCP/IP system is not a formal standard like the OSI one, but was developed with 
the "implement as you go, experiment some more" philosophy that has become part 
of the Internet culture, versus the "specify first, implement later" approach taken by 
the ISO committee (Peterson and Davie, 2000). The TCP/IP is part of the Internet and 
preceded the OSI system. However, the TCP/IP can be implemented using the OSI 
structure, and some of the OSI layers are part of the TCP/IP system. A TCP/IP model 
is shown in Fig. 12.6—3. The application and physical layers are similar to the OST 
model with the other ones described as follows. 


1. Network. Manages the exchange of data between the computer and the 
attached network. 


2. Internet. Causes the data to be transferred across the network or, using a 
router, sends the data to a different network. A router is a processor that trans- 
fers data from one network to another as it moves from source to destination. A 
router also determines the best path for data to travel. As Fig. 12.6—3 shows, the 
Internet layer enables the linking of multiple networks where each one may have 
a different technology and data format. As an aside, a bridge is a device that per- 
forms a similar function as a router. However, a bridge links like networks 
together and thus is much simpler than a router. For example, a bridge could link 
systems that conform to IEEE 802.3 such as an Ethernet and token ring LANs. A. 
repeater can also be used to link networks, but only if the network and data for- 







Application | Application 








Network #1 Network #2 


Physical #1 





Physical #2 


Figure 12.6-3 TCP/IP architecture for two networks. 
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mat are completely identical, as in the linking of two Ethernets, for example. A 
repeater makes two networks look like one and therefore is usually found in sys- 
tems where, due to cable length restrictions, a network has to be split into two 
separate pieces. A firewall is a router that is programmed to prevent unautho- 
rized access to a given site or to a section of a given site. This is in addition to 
any encryption that individual users may already employ. Firewalls are a rela- 
tively centralized security system. For example, a bank may have a firewall pre- 
venting unauthorized access to certain parts of its site, but allowing individual 
le depositors access to their accounts using personalized encryption. 





U 


TCP. This is also called the host-to-host or transport layer. It numbers each 
data segment, so it can be put in the correct sequence at the source; at the desti- 
nation, it puts the data in the correct sequence, so it is reliably delivered to the 
application. If reliability is not a major concern, an alternative logical channel 
to data delivery between the IP and network layers is the user datagram pro- 
tocol (UDP) for the transfer of single packet messages we call datagrams. The 
UDP has the advantage of lower overhead than the TCP channel. 


You will note that in contrast to the OSI system, except for the physical and 
application layers, there is not a great deal of similarity between the OSI and TCP/IP 
layers. OSI defines a rigid set of requirements that specifies what function is done in 
each layer, whereas TCP/IP primarily defines a method for exchanging data 
between different networks. Generally, there is no rigid requirement that a given 
function such as error correction and detection reside in any one particular layer. 

The subject of computer networks is an entire study in itself. More detailed 
information with respect to analysis and design of computer networks can be found 
in Tannenbaum (1989), Stallings (2000), Miller (2000), and Peterson and Davie 
(2000). 


12.7 PROBLEMS 


12.1-1* An analog waveform with W = 15 kHz is to be quantized to q = 200 levels and 
transmitted via an M-ary PCM signal having M — 2". Find the maximum allowed 
values of v and f, and the corresponding value of n when the available transmission 
bandwidth is By = 50 kHz. 


12.1-2 Do Prob. 12.1-1 with By = 80 kHz. 


12.1-3 A Hyperquantization is the process whereby N successive quantized sample values are 
represented by a single pulse with q" possible values. Describe how PCM with 
hyperquantization can achieve bandwidth compression, so By < W. 


12.1-4 Suppose the PCM quantization error e, is specified to be no greater than +P percent 
of the peak-to-peak signal range. Obtain the corresponding condition on v in terms 
of M and P. 
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A voice signal having W = 3 kHz and S, = 1/4 is to be transmitted via M-ary 
PCM. Determine values for M, v, and f, such that (5/N)5 = 40 dB if Bp = 16 kHz. 


Do Prob. 12.1-5 with (S/N )p = 36 dB and B; = 20 kHz. 


An audio signal with S, — 0.3 is to be transmitted via a PCM system whose param- 
eters must satisfy the standards for broadcast-quality audio transmission listed in 
Table 9.4—1. (a) If M = 2, then what are the required values of v and B}? (b) If 
Br = 4W, then what's the minimum value of M? 

Do Prob. 12.1-7 for high-fidelity audio transmission standards. 


What is the minimum number of bits required for an ADC to quantize a signal that 
varies from 5 uV to 200 mV so that (S/N), = 40 dB? 


What is the q, A and (S/N), for a PCM system with a v = 12-bit ADC and a bipo- 
lar +10 sinusoidal input? 


Do Prob. 12.1-10 with v — 16 bits. 


How many bits are required for an ADC to encode music where the dynamic range 
is 120 dB? 


What is the minimum size memorv required to store 10 minutes of a sampled and 
quantized voice assuming S,/o7 = 35 dB and f, = 8 kHz. 


Consider a uniform quantizer with v = 12 bits and inputs between 0 /—10 V. 
What is e; for an input of (a) + 0.02 volts, (b) +0.2 volts? 


Consider the signal PDF in Fig. P12.1—15. Use Eq. (9) to show that uniform quan- 
tization with q = 2” levels and v = 3 yields e} = 1/3q*. Then calculate 5, and find 
(S/N )p in terms of q. 


Px) 











Figure P12.1-15 


Consider a signal whose PDF is reasonably smooth and continuous and satisfies the 
condition P[|x(f)| > 1] << 1. From Eq. (9), show that uniform quantization with 
q >> 1 yields o2 ~ 1/3q*. 

Let x(t) have a zero-mean gaussian PDF with variance c? such that 
P[|x(t)| > 1] = 0.01. Use the result from Prob. 12.1-16 to obtain (S/N)p in a form 
like Eq. (7), assuming uniform quantization with q > 1. 
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Let x(f) have a Laplace PDF, as given by Eq. (13), with variance o? such that 
P[ |x(t)| > 1] = 0.01. Use the result from Prob. 12.1—16 to obtain (S/N )p in a form 
like Eq. (7), assuming uniform quantization with g > 1. 


Consider the compressor characteristic z(x) = (sgn x) V |x| for |x| = 1. (a) Find 
and sketch the complementary expander characteristic xa). (b) Evaluate K, when 
x(t) has the PDF in Fig. P12.1-15. 


Consider the u-law compressor in Example 12.1-4. (a) Derive the complementary 
expander characteristic x(z). (b) Carry out the details leading to K, as given in Eq. (15). 


Repeat Prob. 12.1—15 if the quantizer is preceded by a u = 255 logarithmic com- 
pandor. 


A voice signal with a Laplace distribution is applied to a compressor having 
z(x) = (sgn x)(1 — e 25) for |x| = 1. (a) Obtain an expression for K, in terms of 
a. Then show that K, = 1/9 fora >> 1 and express (S/N )p in dB in terms of v and 
S, for binary voice PCM with this compressor. (b) Now take g = 2? and evaluate 
S,, K, and (S/N)5 for a = 4, 8, and 16. Use these results to make a plot like Fig. 
12.1-8. Your plot should demonstrate that this compressor yields a higher (S/N )p 
but less dynamic range than the standard w-law compressor. 


The A-law companding system employs a compressor with 


RS ae + In A) 0zxxl1l/A 
LOG Nita InAx/(] - mA) WA<x<1 


and z(x) = —z(x) for —1 = x < 0. (a) Assume py(x) has even symmetry and neg- 
ligible area outside |x| = 1 to show that 


VA, | 
K,=(1 + ina), + 2 (4 = x? px) dx 
0 


(b) Obtain the expression for K, in terms of A and a when x(t) has a Laplace distri- 
bution, and show that K, ^ (1 + 1n A)?/A? when a > A. (c) Let A = 100 and 
q = 28. Evaluate S,, K, and (S/N)p for a = 4, 16, 64 and a >> 100. Use these 
results to make a plot like Fig. 12.1-8. Your plot should demonstrate that A-law 
companding yields a lower (S/N)p but greater dynamic range than standard u-law 
companding. 

A signal with S, = 1/2 and W = 6 kHz is to be transmitted via M-ary PCM on a 
channel having Ny = 0.01 wW/Hz and By = 15 kHz. Find the smallest value of M 
and the corresponding smallest value of v that yields (S/N)5 = 36 dB. Then calcu- 
late the minimum value of Sp for operation above threshold, assuming the PCM sig- 
nal occupies the full transmission bandwidth. 


Do Prob. 12.2-1 with By = 20 kHz. 
Do Prob. 12.2-1 with B; = 50 kHz. 
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Consider a voice PCM system with M = 2,v = 8, and u-law companding, so 
(S/N)p = 37 dB. The PCM signal is transmitted via a regenerative repeater system 
with 20 identical sections. Find the minimum value of y at the input to the first 
repeater for operation above threshold. Then determine the PCM advantage in dB by 
calculating the value of y at the input to the first repeater of an analog repeater sys- 
tem that yields (S/N), = 37 dB after 20 sections. 


Do Prob. 12.2-4 with 100 sections. 


Show from Eq. (3) that decoding noise decreases (S/N)p by 1 dB when 
P, = 1/15q’. Calculate the corresponding threshold value of y for binary PCM in 
gaussian noise when v = 4, 8, 12. Then take S, = 1 and plot (S/N )p in dB versus 
yg in dB. 


Consider binary PCM with sign/magnitude codewords. Figure 12.1—1b reveals that 
an error in the sign bit shifts the decoded level by an amount *2(2i — 1)/g where 
i= 1,2,..., q/2. Show that €2, = (Sq? — 8)/3vq?, so e}, = 5/3v if q > 1. 


Suppose a PCM system has fixed values of y and b = B,/W. Taking threshold into 
account, obtain an expression for the maximum value of q. 


Find and plot y, in dB versus b = Bz/W for a PCM system with q = 256 and v = b. 


What A minimizes slope overload for a DM system where the input is a normalized 
sinusoid, given f, = 30 kHz and W = 3 kHz? 


What is the maximum amplitude for a 1 KHz sinusoidal input for a DM system that 
prevents slope-overload that has been sampled at 10 times the Nyquist rate with 
A = 0.117 V? 


The signal x(t) = 8 cos 27 Wt is the input to a delta modulator with T, = 1/24W. 
Plot the sample points x(kT,) for 0 = k = 30. Then take x (0) = Q and plot x (t) 
with A = land A — 3. 


Use the results of Example 12.3-1 to plot (S/N )p in dB versus b for DM voice trans- 
mission with b = 4, 8, and 16. Compare with binary PCM transmission, taking 
v = band S, = 1/30. 

Use the results of Example 12.3—1 to tabulate f, and the optimum value of A for DM 
voice transmission with b = 4, 8, and 16 when S, = 1/9. 


Suppose a signal to be transmitted via DM has a dominant spectral peak at 
f = fo < W, so that |x(t)| nox = 27f. Obtain an upper bound on (S/N )p in a form 
like Eq. (12). 

The power spectrum of a lowpass filtered voice signal can be approximated as 


Gf) = [K/(f% + f£?) ]U(f/2W). Find K in terms of S, and obtain an expression 
for the rms bandwidth. Evaluate Wms when W = 4 kHz and f} = 0.8 kHz. 
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An approximate expression for DM slope-overload noise is given by Abate (1967) 
as 


87? / Was V. 
No = Sr (Tæ) Bsa Dens 


Write A?/f, in Eq. (7) in terms of s and b, and show that the total quantization noise 
N; + N; is minimized by taking s = 1n 2b. 
Find the tap gains and evaluate the prediction gain in dB for DPCM with a one-tap 


and a two-tap transversal filter when the input is a voice signal having p, = 0.8 and 
P2 7 0.6. 


Do Prob. 12.3-9 for a TY image signal having p, = 0.95 and p, = 0.90. 


Consider a DPCM system with a two-tap transversal prediction filter. Assuming 


q >> 1 so that x,(kT,) ~ x(kT,), find the tap gains to implement the prediction 
strategy 


e dx(t) 
x4(kT,) ec x[(k = 1)7,] + ae i=(k- 1T, 

Consider a DPCM system with a one-tap transversal prediction filter and q => 1, so 

X(KT,) ^ x{kT,) and e,(kT,) ~ x(kT,) — X,(kT,). Obtain an expression for the 

mean square prediction error e? = E[e2(k)] in terms of S, and the signal's autocor- 

relation R(T) = E[x(t)x(t — 7). Then find the value of the tap gain that minimizes 

e. 

Use the method outlined in Prob. 12.3—12 to find the tap gains for a two-tap trans- 

versal prediction filter. Express your results in a matrix like Eq. (165). 

How many bits can a CD store? 

What percentage of a CD will it take to store the Bible if it consists of 981 pages, 

two columns per page, 57 lines per column, 45 characters per line, and each charac- 

ter has 7 bits? 


How many minutes of music can be recorded on a 2-Gbyte hard drive with f, = 44.1 
kHz, v — 16 bits, and two recording channels? 


Several high-fidelity audio channels having W = 15 kHz are to be transmitted via 
binary PCM with v — 12. Determine how many of the PCM signals can be accom- 


modated by the first level of the AT&T multiplexing hierarchy. Then calculate the 
corresponding bandwidth efficiency. 


Do Prob. 12.5—1 for the first level of the CCIT hierarchy. 


Determine the number of voice telephone signals that can be carried by a STS-1 
SONET. 
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12.5-4* How long does it take to transmit an 8 X 10 inch image with 600 dots per inch res- 
olution and 1 bit per dot over a BRI channel? 


12.5-5 Repeat Prob. 12.5-4 with a 56-kbps modem. 


chapter 


13 


Channel Coding and Encryption 


CHAPTER OUTLINE 


13.4 


Error Detection and Correction 


Repetition and Parity-Check Codes Interleaving Code Vectors and Hamming Distance FEC Systems 
ARQ Systems 


Linear Block Codes 

Matrix Representation of Block Codes Syndrome Decoding Cyclic Codes M-ary Codes 
Convolutional Codes 

Convolutional Encoding Free Distance and Coding Gain Decoding Methods Turbo Codes 
Data Encryption 

Data Encryption Standard Rivest-Shamir-Adleman System 


547 





548 CHAPTER 13. € Channel Coding and Encryption 


primary objective of transmitting digital information is to minimize errors and, in some cases, to maintain data 

security. Transmission errors in digital communication depend on the signalto-noise ratio. If a particular system 
has a fixed value of S/N and the error rate is unacceptably high, then some other means of improving reliability must 
be sought. Errorcontrol coding often provides the best solution. 

The tremendous expansion of electronic commerce (e-commerce) has been made possible by improved and 
economical data security systems. Data security has three goals: secrecy, to prevent unauthorized eavesdropping; 
authenticity, to verify the sender's signature and prevent forgery; and finally integrity, to prevent message alter- 
ation by unauthorized means. All three of these goals require a system of coding called encryption. 

Errorcontrol coding involves the systematic addition of extra digits to the transmitted message. These extra check 
digits convey no information by themselves, but make it possible to detect or correct errors in the regenerated mes- 
sage digits. In principle, information theory holds out the promise of nearly errorless transmission, as will be discussed 
in Chap. 16. In practice, we seek some compromise between conflicting considerations of reliability, efficiency, and 
equipment complexity. A multitude of error-control codes have therefore been devised to suit various applications. 

Encryption codes, on the other hand, are used to scramble a message by diffusion and/or confusion and 
thereby prevent unauthorized access. Diffusion is where the message is spread out, thus making it more difficult for 
the enemy to look for patterns and use other statistical methods to decode, or decipher, the message. Confusion is 
where the cryptographer uses a complex set of transformations to hide the message. Obviously the goal is to create 
a ciphertext that is unconditionally secure such that no amount of computing power can decipher the code. However, 
practical systems are computationally secure, meaning it would take a certain number of years to break the cipher. 

This chapter starts with an overview of errorcontrol coding, emphasizing the distinction between error detec- 
tion and error correction and systems that employ these strategies. Subsequent sections describe the two major 
types of codes implementations, block codes and convolutional codes. We will then present the basic ideas 
behind data encryption and decryption and look at the widely used data encryption standard (DES) and the 
Rivest-Shamir-Adleman (RSA) system. We'll omit formal mathematical analysis. Detailed treatments of error 
control coding and encryption are provided by the references cited in the supplementary reading list. 


OBJECTIVES 


After studying this chapter, and working the exercises, you should be able to do each of the following: 


l. Explain how parity checking can be used for error detection or correction, and relate the error-control properties 
of a code to its minimum distance (Sect. 13.1). 

2. Explain how interleaving codewords can be used to make error correction and detection methods more effective 
(Sect. 13.1). 


3. Calculate the message bit rate and error probability for a forward error correction (FEC) system with a given 
block code (Sect. 13.1). 


Analyze the performance of an ARQ system with a given block code (Sect. 13.1). 
Describe the structure of a systematic linear block code or cyclic code (Sect. 13.2). 


Use matrix or polynomial operations to perform encoding and decoding operations of a given code (Sect. 13.2). 
Describe the operation of convolutional codes (Sect. 13.3). 


Se Noose 


Describe the basic concepts of data secret-key and public-key encryption systems, and encrypt data using the 
RSA encryption algorithm (Sect. 13.4). 
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13.1 ERROR DETECTION AND CORRECTION 


Coding for error detection, without correction, is simpler than error-correction cod- 
ing. When a two-way channel exists between source and destination, the receiver 
can request retransmission of information containing detected errors. 

This error-control strategy, called automatic repeat request (ARQ), particu- 
larly suits data communication systems such as computer networks. However, when 
retransmission is impossible or impractical, error control must take the form of for- 
ward error correction (FEC) using an error-correcting code. Both strategies will 
be examined here, after an introduction to simple but illustrative coding techniques. 


Repetition and Parity-Check Codes 


When you try to talk to someone across a noisy room, you may need to repeat your- 
self to be understood. A brute-force approach to binary communication over a noisy 
channel likewise employs repetition, so each message bit is represented by a code- 
word consisting of n identical bits. Any transmission error in a received codeword 
alters the repetition pattern by changing a 1 to a 0 or vice versa. 

If transmission errors occur randomly and independently with probability P, = 


a, then the binomial frequency function gives the probability of i errors in an n-bit 
codeword as 


P(i,n) — (") a(1— ay! [1a] 
ni , 
= (") at a< i 
i 


(") ~ TE "TN metn [15] 


where 


We'll proceed on the assumption that œ << 1—which does not necessarily imply 
reliable transmission since œ = 0.1 satisfies our condition but would be an unac- 
ceptable error probability for digital communication. Repetition codes improve reli- 
ability when « is sufficiently small that P(i + 1, n) < P(i, n) and, consequently, 
several errors per word are much less likely than a few errors per word. 

Consider, for instance, a triple-repetition code with codewords 000 and 111. All 
other received words, such as 001 or 101, clearly indicate the presence of errors. 
Depending on the decoding scheme, this code can detect or correct erroneous 
words. For error detection without correction, we say that any word other than 000 
or 111 is a detected error. Single and double errors in a word are thereby detected, 
but triple errors result in an undetected word error with probability 


P,. = P(3,3) = a 
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For error correction, we use majority-rule decoding based on the assumption that at 
least two of the three bits are correct. Thus, 001 and 101 are decoded as 000 and 
111, respectively. This rule corrects words with single errors, but double or triple 
errors result in a decoding error with probability 


P = P(2, 3) + P(3,3) = 3a? — 2o 


Since P, = a would be the error probability without coding, we see that either 
decoding scheme for the triple-repetition code greatly improves reliability if, say, 
a = 0.01. However, this improvement is gained at the cost of reducing the message 
bit rate by a factor of 1/3. 

More efficient codes are based on the notion of parity. The parity of a binary 
word is said to be even when the word contains an even number of 1s, while odd par- 
ity means an odd number of 1s. The codewords for an error-detecting parity-check 
code are constructed with n — 1 message bits and one check bit chosen such that all 
codewords have the same parity. With n = 3 and even parity, the valid codewords are 
000, 011, 101, and 110, the last bit in each word being the parity check. When a 
received word has odd parity, 001 for instance, we immediately know that it contains 
a transmission error—or three errors or, in general, an odd number of errors. Error cor- 
rection is not possible because we don’t know where the errors fall within the word. 
Furthermore, an even number of errors preserves valid parity and goes unnoticed. 

Under the condition æ << 1, double errors occur far more often than four or more 


errors per word. Hence, the probability of an undetected error in an n-bit parity-check 
codeword is 


— 1 
Pye ~ P2, n) ~ T o [2] 





For comparison purposes, uncoded transmission of words containing n — 1 mes- 
sage bits would have 


Pawe = 1 = POn — 1) ~ (n — 1)a 


Thus, if n = 10 and « = 1073, then P,,, = 107? whereas coding yields P pe = 5 X 
1075 with a rate reduction of just 9/10. These numbers help explain the popularity of 
parity checking for error detection in computer systems. 

As an example of parity checking for error correction, Fig. 13.1—1 illustrates an 
error-correcting scheme in which the codeword is formed by arranging k message 
bits in a square array whose rows and columns are checked by 2k parity bits. A 
transmission error in one message bit causes a row and column parity failure with 


the error at the intersection, so single errors can be corrected. This code also detects 
double errors. 


Interleaving 


Throughout the foregoing discussion we've assumed that transmission errors appear 
randomly and independently in a codeword. This assumption hold for errors caused 
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Figure 13.1-1 Square array for error correction by parity checking. 


by white or filtered white noise. But impulse noise, produced by lightning and 
switching transients, causes errors to occur in bursts that span several successive 
bits. Burst errors also appear when radio-transmission systems suffer from rapid 
fading or if the channel has memory where a given data symbol is a function of pres- 
ent and past symbols. Such multiple errors wreak havoc on the performance of con- 
ventional codes and must be combated by special techniques. Conventional error 
control methods such as parity checking are designed for errors that are isolated or 
statistically independent events. 

One solution is to spread out the transmitted codewords using a system of 
interleaving as represented in Fig. 13.1—2. Here the message bits are dispersed with 
the curved line connecting the original message and parity bit sequence in one par- 
ity word. These check bits enable us to check for single errors. Consider the case of 
a system that can only correct single errors. An error burst occurs such that several 
successive bits have been corrupted. If this happens to the original bit sequence, the 
system would be overwhelmed and unable to correct the problem. However, let's 
say the same error burst occurs in the interleaved transmission. In this case succes- 
sive bits from different message words have been corrupted. When received, the bit 
sequence 1s reordered to its original form and then the FEC can correct the faulty 
bits. Therefore, our single error correction system is able to fix several errors. 





m m m C m m m C m m m C m m m c n e &: 8 


Figure 13.1-2 Interleaved check bits for error control with burst errors. 
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Code Vectors and Hamming Distance 


Rather than continuing a piecemeal survey of particular codes, we now introduce a 
more general approach in terms of code vectors. An arbitrary n-bit codeword can be 
visualized in an n-dimensional space as a vector whose elements or coordinates equal 
the bits in the codeword. We thus write the codeword 101 in row-vector notation as X — 
(10 1). Figure 13.1—3 portrays all possible 3-bit codewords as dots corresponding to the 
vector tips in a three-dimensional space. The solid dots in part (a) represent the triple- 
repetition code, while those in part (5) represent a parity-check code. 

Notice that the tripie-repetition code vectors have greater separation than the 
parity-check code vectors. This separation, measured in terms of the Hamming dis- 
tance, has direct bearing on the error-control power of a code. The Hamming dis- 
tance d(X, Y) between two vectors X and Y is defined to equal the number of differ- 
ent elements. For instance, i£ X = (1 0 1) and Y = (1 1 0) then d(X, Y) = 2 because 
the second and third elements are different. 

The minimum distance d, of a particular code is the smallest Hamming dis- 
tance between valid code vectors. Consequently, error detection is always possible 
when the number of transmission errors in a codeword is less than d, so the erro- 
neous word is not a valid vector. Conversely, when the number of errors equals or 
exceeds dmim» the erroneous word may correspond to another valid vector and the 
errors cannot be detected. 

Further reasoning along this line leads to the following distance requirements 
for various degrees of error control capability: 


Detect up to € errors per word dy m €61 [3a] 
Correct up to t errors per word dan Z D6 tk [3b] 
Correct up to t errors and detect € > terrors per word fie ee fq 1 Bq 





(a) 


Figure 13.1-3 Vectors representing 3-bit codewords. (a) Triple-repetition code; (b) parity- 
check code. 
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By way of example, we see from Fig. 13.1-3 that the triple-repetition code has 
dain = 3. Hence, this code could be used to detect € = 3 — 1 = 2 errors per word or 
to correct ¢ = (3 — 1)/2 = 1 error per word—in agreement with our previous 
observations. A more powerful code with dmn = 7 could correct triple errors or it 
could correct double errors and detect quadruple errors. 

The power of a code obviously depends on the number of bits added to each 
codeword for error-control purposes. In particular, suppose that the codewords con- 
sist of k < n message bits and n — k parity bits checking the message bits. This 
structure is known as an (n, k) block code. The minimum distance of an (n, k) block 
code 1s upper-bounded by 


ain =n k tl [4] 
and the code's efficiency is measured by the code rate 
Re = k/ n [5] 


Regrettably, the upper bound in Eq. (4) is realized only by repetition codes, which 
have k = 1 and very inefficient code rate R, = 1/n. Considerable effort has thus 
been devoted to the search for powerful and reasonably efficient codes, a topic we'll 
return to in the next section. 


FEC Systems 


Now we're prepared to examine the forward error correction system diagrammed in 
Fig. 13.1—4. Message bits come from an information source at rate r,. The encoder 
takes blocks of k message bits and constructs an (n, k) block code with code rate 
R, = k/n < 1. The bit rate on the channel therefore must be greater than r,, namely 


r= (n/k)r, = r/R: [6] 


The code has dnin = 2t + 1 x n — k + 1, and the decoder operates strictly in an 
error-correction mode. 
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Figure 13.1-4 FEC system. 
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We'll investigate the performance of this FEC system when additive white 
noise causes random errors with probability a << 1. The value of « depends, of 
course, on tbe signal energy and noise density at the receiver. If E, represents the 
average energy per message bit, then the average energy per code bit is RE, and the 
ratio of bit energy to noise density is 


Ye = R.E,/No = Rey, [7] 


where y, = E,/N,. Our performance criterion will be the probability of output 
message-bit errors, denoted by P, to distinguish it from the word error probability P,e. 

The code always corrects up to t errors per word and some patterns of more than 
t errors may also be correctable, depending upon the specific code vectors. Thus, the 
probability of a decoding word error is upper-bounded by 


Pe X Phi n) 
i=f+] 


For a rough but reasonable performance estimate, we'll take the approximation 


Po P(ttln)e 1 n : a't! [8] 


which means that an uncorrected word typically has t + 1 bit errors. On the average, 
there will be (k/n)(t -- 1) message-bit errors per uncorrected word, the remaining 
errors being in check bits. When Nk bits are transmitted in N >> 1 words, the 
expected total number of erroneous message bits at the output is (k/n)(t + 1)NP we. 


Hence, 
ttl e 
Pie "ES Pre ea (C Jan [9] 
n t 





in which we have used Eq. (1b) to combine (t + 1)/n with the binomial coefficient. 
Tf the noise has a gaussian distribution and the transmission system has been 
optimized (i.e., polar signaling and matched filtering), then the transmission error 

probability is given by 
a= Q(V2x.) = (VR) (10) 


aU (AmR,y,) Pe UE Rey, = 5 


The gaussian tail approximation invoked here follows from Eq. (9), Sect. 8.4, and is 
consistent with the assumption that o << 1. Thus, our final result for the output error 
probability of the FEC system becomes 


Py = (^ 7 tov y ij 


" 4 = Jen yr eerie 
t 








13.1 Error Detection and Correction 


Uncoded transmission on the same channel would have 
Pu = Q(va v y) = (4T Yb) y We oe [12] 


since the signaling rate can be decreased from r,/R, to ry. 

À comparison of Eqs. (11) and (12) brings out the importance of the code 
parameters £ = (dnin — 1)/2 and R, = k/n. The added complexity of an FEC sys- 
tem is justified provided that t and R, yield a value of P,, significantly less than Pibe- 
The exponential approximations show that this essentially requires (t+ 1)R, > 1. 
Hence, a code that only corrects single or double errors should have a relatively high 
code rate, while more powerful codes may succeed despite lower code rates. The 
channel parameter y, also enters into the comparison, as demonstrated by the fol- 
lowing example. 


Um DE ds 11) block odad = 3, sot = Edk. = 11/15. An 


FEC system using this code would have o = = ol V (22/15), | and P, = 140°, 
whereas uncoded transmission on the same channel would yield Puse = Q( V 2y,). 
These three probabilities are plotted versus y, in dB in Fig. 13.1-5. If y, > 8 dB, we 
see that coding decreases the error probability by at least an order of magnitude 
compared to uncoded transmission. At y, = 10 dB, for instance, uncoded transmis- 
sion yields Pse = 4 X 107° whereas the FEC system has P,, ^ 107’ even though 
the higher channel bit rate increases the transmission error probability to a = 
6 x 105. 
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Figure 13.1-5 Curves of error probabilities in Example 13.1-1 
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If y, « 8 dB, however, coding does not significantly improve reliability and 
actually makes matters worse when y, < 4 dB. Furthermore, an uncoded system 
could achieve better reliability than the FEC system simply by increasing the signal- 
to-noise ratio about 1.5 dB. Hence, this particular code doesn't save much signal 
power, but it would be effective if y, has a fixed value in the vicinity of 8-10 dB. 





EXERCISE 13.1-1 Suppose the system in Example 13.1-1 is operated at y, = 8 dB so a = 0.001. Evalu- 
ate P(i, n) for i = 0, 1, 2, and 3. Do your results support the approximation in Eq. (8)? 





ARQ Systems 


The automatic-repeat-request strategy for error control is based on error detection 
and retransmission rather than forward error correction. Consequently, ARQ sys- 
tems differ from FEC systems in three important respects. First, an (n, k) block code 
designed for error detection generally requires fewer check bits and has a higher k/n 
ratio than a code designed for error correction. Second, an ARQ system needs a 
return transmission path and additional hardware in order to implement repeat trans- 
mission of codewords with detected errors. 'Third, the forward transmission bit rate E 
must make allowance for repeated word transmissions. The net impact of these dif- 3 
ferences becomes clearer after we describe the operation of the ARQ system repre- jj 
sented by Fig. 13.1—-6. 3 
Each codeword constructed by the encoder is stored temporarily and transmit- 
ted to the destination where the decoder looks for errors. The decoder issues a posi- 7 
tive acknowledgment (ACK) if no errors are detected, or a negative acknowledg- 3 
ment (NAK) if errors are detected. A negative acknowledgment causes the input r 
controller to retransmit the appropriate word from those stored by the input buffer. A. 
particular word may be transmitted just once, or it may be transmitted two or more 
times, depending on the occurrence of transmission errors. The function of the out- 
put controller and buffer is to assemble the output bit stream from the codewords 
that have been accepted by the decoder. 





































Input buffer Forward Output buffer 
— Encoder m> and ou Decoder and 
transmission 
controller controller 
Return ACK/NAK 
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Figure 13.1-6 ARQ system. 
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Compared to forward transmission, return transmission of the ACK/NAK sig- 
nal involves a low bit rate and we can reasonably assume a negligible error proba- 
bility on the return path. Under this condition, all codewords with detected errors are 
retransmitted as many times as necessary, so the only output errors appear in words 
with undetected errors. For an (n, k) block code with dwin = € + 1, the corresponding 
output error probabilities are 


Py. = >, Plin) = P(e + 1n) ~( i Jet" [13] 


iE £401 


(1 E 
Pre = ie P = ( ? Jet" [14] 


which are identical to the FEC expressions, Eqs. (8) and (9), with € in place of r. 
Since the decoder accepts words that have either no errors or undetectable errors, the 
word retransmission probability is given by 





p] [pU n) + Pwe] 
But a good error-detecting code should yield P,, << P(0, n). Hence, 
p*1-P(0n)21-(1—2a)-n« [15] 


where we've used the approximation (1 — a)" = 1 — na based on na < 1. 

As for the retransmission process itself, there are three basic ARQ schemes 
illustrated by the timing diagrams in Fig. 13.1—7. The asterisk marks words received 
with detected errors which must be retransmitted. The stop-and-wait scheme in part - 
a requires the transmitter to stop after every word and wait for acknowledgment 
from the receiver. Just one word needs to be stored by the input buffer, but the trans- 
mission time delay ¢, in each direction results in an idle time of duration D = 2t, 
between words. Idle time 1s eliminated by the go-back-N scheme in part b where 
codewords are transmitted continuously. When the receiver sends a NAK signal, the 
transmitter goes back N words in the buffer and retransmits starting from that point. 
The receiver discards the N — 1 intervening words, correct or not, in order to pre- 
serve proper sequence. The selective-repeat scheme in part c puts the burden of 
sequencing on the output controller and buffer, so that only words with detected 
errors need to be retransmitted. 

Clearly, a selective-repeat ARQ system has the highest throughput efficiency. 
To set this on a quantitative footing, we observe that the total number of transmis- 
sions of a given word is a discrete random variable m governed by the event proba- 
bilities P(m = 1) = 1 — p, P(m = 2) = p(1 — p), and so on. The average number 
of transmitted words per accepted word is then 


m= 1(1 — p) + 2p(1 - p) -3p1—- p) * > [16] 


(1— pl + 2p + 3p? + =) = —— 
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ARQ schemes. (a) Stop-and-wait; (b] go-back-N; (c} selective-repeat. 


since 1 + 2p + 3p? + + = (1 — p) ?. On the average, the system must transmit 
nm bits for every k message bits, so the throughput efficiency is 
k k 
R. -——--(ü- 17 
(we cp) [17] 


in which p = na, from Eq. (15). 


We use the symbol R; here to reflect the fact that the forward-transmission bit 
rate r and the message bit rate r, are related by 


r = HR, 


comparable to the relationship r = r,/R, in an FEC system. Thus, when the noise 
has a gaussian distribution, the transmission error probability o is calculated from 
Eq. (10) using R; instead of R, = k/n. Furthermore, if p << 1, then R} = k/n. But 
an error-detecting code has a larger k/n ratio than an error-correcting code of equiv- 
alent error-control power. Under these conditions, the more elaborate hardware 
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needed for selective-repeat ARQ may pay off in terms of better performance than an 
FEC system would yield on the same channel. 

The expression for m in Eq. (16) also applies to a stop-and-wait ARQ system. 
However, the idle time reduces efficiency by the factor 7,/(T,, + D) where D = 2t, 
is the round-trip delay and 7, is the word duration given by T, = n/r x k/r,. 
Hence, 

"mS: l—-p z k l—p 


* nl-(D/T,) 21+ (2tyrs/k) 





[18] 


in which the upper bound comes from writing D/T,, = 2t,r,/k. 
A. go-back-N ARQ system has no idle time, but N words must be retransmitted 
for each word with detected errors. Consequently, we find that 





N 
mmn queue [19] 
Lap 
and 
k l — k E 
gms A E Piece [20] 
n]—-p-ctNp n1-—p+ (2tyr,/k)p 


where the upper bound reflects the fact that N = 2t,/T,,. 

Unlike selective-repeat ARQ, the throughput efficiency of the stop- -and- wait 
and go-back-N schemes depends on the round-trip delay. Equations (18) and (20) 
reveal that both of these schemes have reasonable efficiency if the delay and bit rate 
are such that 2tzr, << k. However, stop-and-wait ARQ has very low efficiency 
when 2t,r, = k, whereas the go-back-N scheme may still be satisfactory provided 
that the retransmission probability p is small enough. 

Finally, we should at least describe the concept of hybrid ARQ systems. These 
systems consist of an FEC subsystem within the ARQ framework, thereby combin- 
ing desirable properties of both error-control strategies. For instance, a hybrid ARC 
system might employ a block code with dmin = t + € + 1, so the decoder can correct 
up to £ errors per word and detect but not correct words with £ > t errors. Error cor- 
rection reduces the number of words that must be retransmitted, thereby increasing 
the throughput without sacrificing the higher reliability of ARQ. 
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n — 10, and 4 — 1. The transmission channel is corrupted by gaussian noise and we 
seek the value of y, needed to get Pj, ^ 10 ?. Equation (14) yields the required 
transmission error probability œ = (1075/9)! = 1.1 x 1073, and the corresponding 
word retransmission probability in Eq. (15) is p ~ 10« << 1. Hence, the throughput 
efficiency will be R} ~ k/n = 0.9, from Eq. (17). Since œ = o( 2R'y, ) we call 
upon the plot of Q in Table T.6 to obtain our final result y; = 3.17/1.8 = 5.3 or 7.3 
dB. As a comparison, Fig. 13.1-5 shows that uncoded transmission would have 
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Pre 79 6 X 1074 if y, = 7.3 dB and requires y, 7 9.6 dB to get P,,, = 1075. The 
ARQ system thus achieves a power saving of about 2.3 dB. 





Assume that the system in Example 13.1—2 has 7, = 50 kbps and t, = 0.2 ms. By 
calculating R, show that the go-back-N scheme would be acceptable but not the 
stop-and-wait scheme when channel limitations require r = 100 kbps. 


EXERCISE 13.1-2 











13.2 LINEAR BLOCK CODES 


This section describes the structure, properties, and implementation of block codes. 
We start with a matrix representation of the encoding process that generates the 
check bits for a given block of message bits. Then we use the matrix representation 
to investigate decoding methods for error detection and correction. The section 
closes with a brief introduction to the important class of cyclic block codes. 


Matrix Representation of Block Codes 


An (n, K) block code consists of n-bit vectors, each vector corresponding to a unique 
block of k « n message bits. Since there are 2* different k-bit message blocks and 
2" possible n-bit vectors, the fundamental strategy of block coding 1s to choose the 
2* code vectors such that the minimum distance is as large as possible. But the code 
should also have some structure that facilitates the encoding and decoding 
processes. We'll therefore focus on the class of systematic linear block codes. 

Let an arbitrary code vector be represented by 


X-(ux '" Xn) 


where the elements x,, X2, . . ., are, of course, binary digits. A code is linear if it 
includes the all-zero vector and if the sum of any two code vectors produces another 
vector in the code. The sum of two vectors, say X and Z, is defined as 


A 
X+Z = (x4, 87 xz diis x, Q z,) [1] 
in which the elements are combined according to the rules of mod-2 addition given 
in Eq. (2), Sect. 11.4. 
As a consequence of linearity, we can determine a code’s minimum distance by 
the following argument. Let the number of nonzero elements of a vector X be sym- 


bolized by w(X), called the vector weight. The Hamming distance between any two 
code vectors X and Z is then 


d(X,Z) = w(X + Z) 
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since x, Q z, = 1 if x, # z,, and so on. The distance between X and Z therefore 
equals the weight of another code vector X + Z. But if Z = (00 --- 0) then 
X + Z = X; hence, 

ig = [w(X) Jimin X * (0 oves 0) [2] 


In other words, the minimum distance of a linear block code equals the smallest 
nonzero vector weight. 


A systematic block code consists of vectors whose first k elements (or last k ele- 
ments) are identical to the message bits, the remaining n — k elements being check 
bits. A code vector then takes the form 


X-(m m,-- m, ec € ' €) [3a] 
where 
q=un-—k [3b] 
For convenience, we'll also write code vectors in the partitioned notation 
X-(M | C) 


in which M is a k-bit message vector and C is a q-bit check vector. Partitioned nota- 
tion lends itself to the matrix representation of block codes. 

Given a message vector M, the corresponding code vector X for a systematic 
linear (n, k) block code can be obtained by a matrix multiplication 


X = MG [4] 
The matrix G is a k X n generator matrix having the general structure 
Gt, | P] [5a] 


where I, is the k X k identity matrix and P is ak X q submatrix of binary digits rep- 
resented by 


Pu Pw `| Pig 
Pa Po `° Pag 

Pes : : [5b] 
Pa Po US Pa 


The identity matrix in G simply reproduces the message vector for the first k ele- 
ments of X, while the submatrix P generates the check vector via 


C= MP [6a] 


This binary matrix multiplication follows the usual rules with mod-2 addition 
instead of conventional addition. Hence, the jth element of C is computed using the 
jth column of P, and 


Cj = m py O mpy D: Om, py; [6b] 
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for j = 1, 2,..., q. All of these matrix operations are less formidable than they 
appear because every element equals either O or 1. 

The matrix representation of a block code provides a compact analytical vehicle 
and, moreover, leads to hardware implementations of the encoder and decoder. But 
it does not tell us how to pick the elements of the P submatrix to achieve specified 
code parameters such as dmin and R.. Consequently, good codes are discovered with 
the help of considerable inspiration and perspiration, guided by mathematical analy- 
sis. In fact, Hamming (1950) devised the first popular block codes several years 
before the underlying theory was formalized. 


Hamming Codes 
A Hamming code is an (n, k) linear block code with q = 3 check bits and 
Ete] k7n-—q [7a] 


The code rate is 


k 
Ree = 1 q 
n 21— 1 





[7b] 


and thus R, = 1 if q >> 1. Independent of q, the minimum distance is fixed at 
dy, = 3 [7d 


so a Hamming code can be used for single-exror correction or double-error detec- 
tion. To construct a systematic Hamming code, you simply let the k rows of the P 
submatrix consist of all g-bit words with two or more 1s, arranged in any order. 

For example, consider a systematic Hamming code with q = 3, so n = 2? — 
1=7andk = 7 — 3 = 4. According to the previously stated rule, an appropriate 
generator matrix is 


ooo} 
DO- O = -O 
n2 e E E) 
—€— O OC Q 
=. =e Ln 
=en © 
=. O me | 


0 0 


The last three columns constitute the P submatrix whose rows include all 
3-bit words that have two or more 1s. Given a block of message bits M — 
(m, m, m, ma), the check bits are determined from the set of equations 


cy = m, Qm, 0 m,00 
0 O m, OQ m,Q m, 
c4,^7mj9m,G089m, 


C2 
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These check-bit equations are obtained by substituting the elements of P into Eq. (6). 

Figure 13.2-1 depicts an encoder that carries out the check-bit calculations for 
this (7, 4) Hamming code. Each block of message bits going to the transmitter is 
also loaded into a message register. The cells of the message register are connected 
to exclusive-OR gates whose outputs equal the check bits. The check bits are stored 
in another register and shifted out to the transmitter after the message bits. An input 
buffer holds the next block of message bits while the check bits are shifted out. The 
cycle then repeats with the next block of message bits. 

Table 13.2-1 lists the resulting 2* = 16 codewords and their weights. The 
smallest nonzero weight equals 3, confirming that d, = 3. 
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Figure 13.2-1 Encoder for [7, 4) Hamming code. 


Table 13.2-1 Codewords for the (7, 4) Hamming code 
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Consider a systematic (6, 3) block code generated by the submatrix 
1 10 

P=);011 
101 


Write the check-bit equations and tabulate the codewords and their weights to show 
that dnin = 3. 





Syndrome Decoding 


Now let Y stand for the received vector when a particular code vector X has been 
transmitted. Any transmission errors will result in Y # X. The decoder detects or 
corrects errors in Y using stored information about the code. 

A direct way of performing error detection would be to compare Y with every 
vector in the code. This method requires storing all 2* code vectors at the receiver 
and performing up to 2* comparisons. But efficient codes generally have large val- 
ues of k, which implies rather extensive and expensive decoding hardware. As an 
example, you need q = 5 to get R, = 0.8 with a Hamming code; then n = 31, k = 
26, and the receiver must store a total of n X 2* > 10? bits! 

More practical decoding methods for codes with large k involve parity-check 
information derived from the code's P submatrix. Associated with any systematic 
linear (n, k) block code is a q X n matrix H called the parity-check matrix. This 


matrix is defined by 
HTS B [8] 
I 


where HT denotes the transpose of H and I, is the q X q identity matrix. Relative to 
error detection, the parity-check matrix has the crucial property 


XH'-(0 0 - 0) [9] 


provided that X belongs to the set of code vectors. However, when Y is not a code 
vector, the product YHT contains at least one nonzero element. 
Therefore, given HT and a received vector Y, error detection can be based on 


S = YHT [10] 


a q-bit vector called the syndrome. If all elements of S equal zero, then either Y 
equals the transmitted vector X and there are no transmission errors, or Y equals 
some other code vector and the transmission errors are undetectable. Otherwise, 
errors are indicated by the presence of nonzero elements in S. Thus, a decoder for 
error detection simply takes the form of a syndrome calculator. A comparison of 


Eqs. (10) and (6) shows that the hardware needed is essentially the same as the 
encoding circuit. 


4 
d 
1 
d 
i 











13.2 Linear Block Codes 


Error correction necessarily entails more circuitry but it, too, can be based on 
the syndrome. We develop the decoding method by introducing an n-bit error vec- 
tor E whose nonzero elements mark the positions of transmission errors in Y. For 
instance, if X = (10110) and Y= (100 1 1) then E = (0010 1). In general, 


Y=X+E [lla] 


and, conversely, 
X=Yt+E [115] 


since a second error in the same bit location would cancel the original error. Substi- 
tuting Y = X + E into S = YH’ and invoking Eq. (9), we obtain 


S = (X + EH = XH! + EH! = EH" [12] 


which reveals that the syndrome depends entirely on the error pattern, not the spe- 
cific transmitted vector. 

However, there are only 24 different syndromes generated by the 2" possible 
n-bit error vectors, including the no-error case. Consequently, a given syndrome does 
not uniquely determine E. Or, putting this another way, we can correct just 27 — 1 
patterns with one or more errors, and the remaining patterns are uncorrectable. We 
should therefore design the decoder to correct the 24 — 1 most likely error patterns— 
namely those patterns with the fewest errors, since single errors are more probable 
than double errors, and so forth. This strategy, known as maximum-likelihood 
decoding, is optimum in the sense that it minimizes the word-error probability. 
Maximum-likelihood decoding corresponds to choosing the code vector that has 
the smallest Hamming distance from the received vector. 

To carry out maximum-likelihood decoding, you must first compute the syn- 
dromes generated by the 2? — 1 most probable error vectors. The table-lookup 
decoder diagrammed in Fig. 13.2-2 then operates as follows. The decoder calcu- 
lates S from the received vector Y and looks up the assumed error vector E stored in 
the table. The sum Y + E generated by exclusive-OR gates finally constitutes the 
decoded word. If there are no errors, or if the errors are uncorrectable, then 
S = (00-0) so Y + E = Y. The check bits in the last q elements of Y + E may 
be omitted if they are of no further interest. 

The relationship between syndromes and error patterns also sheds some light 
on the design of error-correcting codes, since each of the 24 — 1 nonzero syndromes 
must represent a specific error pattern. Now there are (1) — n single-error patterns 
for an n-bit word, (7) double-error patterns, and so forth. Hence, if a code is to cor- 
rect up to ż errors per word, q and n must satisfy 


s-iese()e e a) [13] 
2 t 
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Figure 13.22 Tablelookup decoder. 


In the particular case of a single-error-correcting code, Eq. (13) reduces to 
27 — ] z n. Furthermore, when E corresponds to a single error in the jth bit of a 
codeword, we find from Eq. (12) that S is identical to the jth row of HT. Therefore, 
to provide a distinct syndrome for each single-error pattern and for the no-error pat- 
tern, the rows of HT (or columns of H) must all be different and each must contain 
at least one nonzero element. The generator matrix of a Hamming code is designed 
to satisfy this requirement on H, while q and n satisfy 24 — 1 = n. 





EXAMPLE 13.2-2 Let's apply table-lookup decoding to a (7, 4) Hamming code used for single-error 
correction. From Eq. (8) and the P submatrix given in Example 13.2-1, we obtain 
the 3 X 7 parity-check matrix 


MES SES NEC || EEG 
H-[P"| LJ=|0 1 1 1/0 10 
1101/00 I 


There are 2? — 1 — 7 correctable single-error patterns, and the corresponding syn- 
dromes listed in Table 13.2-2 follow directly from the columns of H. To accommo- 
date this table the decoder needs to store only (q + n) X 2? = 80 bits. 

But suppose a received word happens to have two errors, such that E = 
(1000010). The decoder calculates S = YH’ = EH" = (111) and the syn- 
drome table gives the assumed single-error pattern E — (0100000). The 
decoded output word Y + E therefore contains three errors, the two transmission 
errors plus the erroneous correction added by the decoder. 

If multiple transmission errors per word are sufficiently infrequent, we need not 
be concerned about the occasional extra errors committed by the decoder. If multi- 
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Table 13.2-2 Syndromes for the (7, 4) Hamming code 
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ple errors are frequent, a more powerful code would be required. For instance, an 
extended Hamming code has an additional check bit that provides double-error 
detection along with single-error correction. 





Use Eqs. (8) and (10) to show that the jth bit of Sis given by EXERCISE 13.2-2 


5j = WiPy e Y2P2 OD YkPkj e Yk+j 


Then diagram the syndrome-calculation circuit for a (7, 4) Hamming code, and 
compare it with Fig. 13.2-1. 





Cyclic Codes 


The code for a forward-error-correction system must be capable of correcting t = 1 
errors per word. It should also have a reasonably efficient code rate R, = k/n. These 
two parameters are related by the inequality 


1-R =t) S (")| [14] 
n i=0 I 


which follows from Eq. (13) with g = n — k = n(1 — R,). This inequality under- 
Scores the fact that if we want R, = 1, we must use codewords with n > 1 and 
k >> 1. However, the hardware requirements for encoding and decoding long code- 
words may be prohibitive unless we impose further structural conditions on the 
code. Cyclic codes are a subclass of linear block codes with a cyclic structure that 
leads to more practical implementation. Thus, block codes used in FEC systems are 
almost always cyclic codes. 

To describe a cyclic code, we'll find it helpful to change our indexing scheme 
and express an arbitrary n-bit code vector in the form 
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X = (Xn-1 Xn-2 ` Ly Xo) [15] 


Now suppose that X has been loaded into a shift register with feedback connection 
from the first to last stage. Shifting all bits one position to the left yields the cyclic 
shift of X, written as 


pls 
AU cop aed oe Xi Xo Maur) [16] 


A second shift produces X" = (x,-3 © xi xg x, x,-5), and so forth. A 
linear code 1s cyclic if every cyclic shift of a code vector X is another vector in the 
code. 


This cyclic property can be treated mathematically by associating a code vector 
X with the polynomial 


X(p) = xp- P + xpp"? ese pay [17] 


where p is an arbitrary real variable. The powers of p denote the positions of the 
codeword bits represented by the corresponding coefficients of p. Formally, binary 
code polynomials are defined in conjunction with Galois fields, a branch of modern 
algebra that provides the theory needed for a complete treatment of cyclic codes. For 
our informal overview of cyclic codes we’ll manipulate code polynomials using 
ordinary algebra modified in two respects. First, to be in agreement with our earlier 
definition for the sum of two code vectors, the sum of two polynomials is obtained 
by mod-2 addition of their respective coefficients. Second, since all coefficients are 
either 0 or 1, and since 1 ® 1 = O0, the subtraction operation is the same as mod-2 
addition. Consequently, if X(p) + Z(p) = 0 then X(p) = Z(p). 
We develop the polynomial interpretation of cyclic shifting by comparing 


pX(p) = X, p'-F x, op" X «Exp + xop 
with the shifted polynomial 
Xp) c ee ane soe a 

If we sum these polynomials, noting that (x;  xj)p? = 0, and so on, we get 

pX(p) + X'(p) = x, p" + xa 
and hence 

X'(p) = pX(p) + xy-i(p" + 1) 118] 
Iteration yields similar expressions for multiple shifts. 


The polynomial p” + 1 and its factors play major roles in cyclic codes. Specif- 
ically, an (n, k) cyclic code is defined by a generator polynomial of the form 


Cp) = pt guip + > +gptl [19] 


where q = n — k and the coefficients are such that G(p) is a factor of p" + 1. Each 
codeword then corresponds to the polynomial product 


X(p) = Ou(p)G(p) [20] 
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in which Q y( p) represents a block of k message bits. All such codewords satisfy the 
cyclic condition in Eq. (18) since G(p) is a factor of both X( p) and p^ + 1. 

Any factor of p" + 1 that has degree q may serve as the generator polynomial 
for a cyclic code, but it does not necessarily generate a good code. Table 13.2-3 lists 
the generator polynomials of selected cyclic codes that have been demonstrated to 
possess desirable parameters for FEC systems. The table includes some cyclic Ham- 
ming codes, the famous Golay code, and a few members of the important family of 
BCH codes discovered by Bose, Chaudhuri, and Hocquenghem. The entries under 
G(p) denote the polynomial’s coefficients; thus, for instance, 1 0 1 1 means that 
Gp)=p?+O0+ptl. 

Cyclic codes may be systematic or nonsystematic, depending on the term 
Qu p) in Eq. (20). For a systematic code, we define the message-bit and check-bit 
polynomials 


M(p) = my pF! + + + mp + mg 
C(p) 7 pt too eee ay 


and we want the codeword polynomials to be 


X(p) = p*M(p) + C(p) [21] 
Equations (20) and (21) therefore require p*M( p) + C(p) = Qy(p)G(p), or 
p*M(p) C(p) 
= 4+ 22a 
Gg) ap) s 


This expression says that C(p) equals the remainder left over after dividing p*M( p) 
by G(p), just as 14 divided by 3 leaves a remainder of 2 since 14/3 = 4 + 2/3. 
Symbolically, we write 




















p*M(p) 
C(p) = rem C [22b] 
G(p) 
where rem [ ] stands for the remainder of the division within the brackets. 
Table 13.2-3 Selected cyclic codes 
Type n k R, oin G(p) 
Hamming 7 4 0.57 3 1 Oll 
Codes 15 11 0.73 3 10 Oll 
31 26 0.84 3 100 101 
BCH 15 7 0.46 5 111 010 001 
Codes 31 21 0.68 5 11 101 101 001 
63 45 0.71 7 1 111 000 001 O11 001 111 
Golay 23 12 0.52 7 101 011 100 011 


Code 
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EXAMPLE 13.2-3 Consider the cyclic (7, 4) Hamming code generated by G(p) = p? - 0 4 p ^ 1. 
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The division operation needed to generate a systematic cyclic code is easily and 
efficiently performed by the shift-register encoder diagrammed in Fig. 13.2-3. 
Encoding starts with the feedback switch closed, the output switch in the message- 
bit position, and the register initialized to the all-zero state. The k message bits are 
shifted into the register and simultaneously delivered to the transmitter. After k shift 
cycles, the register contains the q check bits. The feedback switch is now opened 
and the output switch is moved to deliver the check bits to the transmitter. 


Syndrome calculation at the receiver is equally simple. Given a received vector 
Y, the syndrome is determined from 


Y(p) | 
S(p) = rem Es [23] 
If Y(p) is a valid code polynomial, then G(p) will be a factor of Y( p) and Y(p)/G(p) 
has zero remainder. Otherwise we get a nonzero syndrome polynomial indicating 
detected errors. 

Besides simplified encoding and syndrome calculation, cyclic codes have other 
advantages over noncyclic block codes. The foremost advantage comes from the 
ingenious error-correcting decoding methods that have been devised for specific 
cyclic codes. These methods eliminate the storage needed for table-lookup decoding 
and thus make it practical to use powerful and efficient codes with n => 1. Another 


advantage is the ability of cyclic codes to detect error bursts that span many succes- 
sive bits. 


We'll use long division to calculate the check-bit polynomial C(p) when M = (1100). 
We first write the message-bit polynomial 


M(p) =p? +p?+0+0 


Feedback 


Input 
E SL —L— 





rc To transmitter 
o 


Message bits 


Figure 13.2-3 Shift-register encoder. 
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so p*M(p) = p Mlp) = p$ + p? - 0-0 -- 0-- 0 +0. Next, we divide G(p) 
into p?M(p), keeping in mind that subtraction is the same as addition in mod-2 
arithmetic. Thus, 








p-0c-p-lipvp-0-0-0-0-20 
p 0+ pep 
p+ p*+ p?+0 
ptoO+p?+ p 





p'+0 + p?+0 

p’'+0 + p*+p 
0+ 0+p+0 
0+0+0+0 
C(p)=0+p+0 


so the complete code polynomial is 
X(p) = p?M(p) + C(p) = p+ p?+0+04+0+p+0 
which corresponds to the codeword 
X=(1 100 ] 01 O) 


You'll find this codeword back in Table 13.2—1, where you'll also find the cyclic 
shift X’ = (1000 |101) and all multiple shifts. 

Finally, Fig. 13.2-4 shows the shift-register encoder and the register bits for 
each cycle of the encoding process when the input is M = (1 1 0 0). After four shift 
cycles, the register holds C = (0 1 0)—in agreement with our manual division. 

Remember that an error is only detected when Y( p) is not evenly divisible by 
G(p). If G(p) is not properly chosen, errors could occur in Y(p) making it evenly 
divisible by G( p) and, therefore, some errors could go undetected. A class of cyclic 
codes we call the cyclic redundancy codes (CRCs) have been designed to minimize 
the possibility of errors slipping through, particularly for burst-error detection. 
CRCS are also characterized by any end-around-cyclic shift of a codeword that pro- 
duces another codeword. Their structure makes for efficient coding and decoding. 
Some of the CRCs are given in Table 13.2-4. 

With CRCs, the following error types will be detected: (a) all single bit errors; 
(b) any odd number of errors, assuming p + 1 is a factor of G(p); (c) burst errors of 
length not exceeding q, where q is the number of check bits; (d) double errors if 
G(p) contains at least three 1s. 

If all patterns of error bursts are equally likely, then for bursts of length q + 1 
the probability of the error being undetected is 1/24^!, and if the burst length is 
greater than q + 1, then the probability of it going undetected is 1/29. See Peterson 


and Brown (1961) for a more extensive discussion on the error detection capabilities 
of CRCs. 
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Input Register bits Register bits after shifts 
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Figure 13.2-4 {a} Shiftregister encoder for (7, 4} Hamming code; {b} register bits when 
M=(1100). 


Table 13.2-4 Cyclic redundancy codes 
Code G(p) q7^n-k 


CRC-8 100 000 111 

CRC-12 1 100 000 001 111 12 
CRC-16 11 000 000 000 000 101 16 
CRC-CCIT 10 001 000 000 100 001 16 


We want to transmit an ASCII letter “J,” which in binary is 1001010, and then be 
able to check for errors using CRC-8 code. Determine the transmitted sequence X 
and then show how the receiver can detect errors in the two left-most message bits 





EXERCISE 13.2-3 
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M-ary Codes 


A subset of the BCH codes that perform well under burst error conditions and can be 
used with M-ary modulation systems are the Reed-Solomon (RS) codes. These are 
nonbinary codes that are members of an M-ary alphabet. If we use an m-bit digital 
encoder we will then have an alphabet of M — 2" symbols. An RS code's minimum 


distance is 
da,7n-k-cl [24] 


and, as with the binary codes, n is the total number of symbols in the code block-and 


k is the number of message symbols. RS codes are capable of correcting t or fewer 
symbol errors with 





RSEN Tea [25] 


i With an M symbol alphabet, we haven = 2" — ] and thus k = 2" — 1 — 2t. 





F Consider a binary BCH (n, k) = (7, 4) system. From this d, = 3 bits. Let's say we 

have an RS (n, k) = (7,4) system where each symbol has m = 3 bits. Thus 

[ dain = n — k = 7 — 4 = 3 symbols. However, in terms of bits, the RS code gives us 

i a distance of (2°)? = 512 bits, whereas the binary BCH code only gave us a 3-bit dis- 

tance. Thus an RS system has the potential for large code distances. In general an M- 

E ary (n, k) RS code with m bits per symbol has (2")" code vectors out of which (2”)* 
= are message vectors. With more symbol bits, the greater the possible code distance. 


[ How many symbols in error can a (63, 15) RS code correct? EXERCISE 13.2-4 


EXAMPLE 13.2-4 








E 13.3 CONVOLUTIONAL CODES 


Convolutional codes have a structure that effectively extends over the entire trans- 

mitted bit stream, rather than being limited to codeword blocks. The convolutional 
| structure is especially well suited to space and satellite communication systems that 
2 require simple encoders and achieve high performance by sophisticated decoding 

methods. Our treatment of this important family of codes consists of selected exam- 
[ ples that introduce the salient features of convolutional encoding and decoding. 


Convolutional Encoding 


The fundamental hardware unit for convolutional encoding 1s a tapped shift register 
with L + 1 stages, as diagrammed in Fig. 13.3-1. Each tap gain g is a binary digit 
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Figure 13.3-1 Tapped shift register for convolutional encoding. 


representing a short-circuit connection or an open circuit. The message bits in the 
register are combined by mod-2 addition to form the encoded bit 


x 


j = mj-,gj€ v €m,.,g,mjgo [1] 


L 
= by mi i (mod-2) 
7=0 


The name convolutional encoding comes from the fact that Eq. (1) has the form of a 
binary convolution, analogous to the convolutional integral 


x(t) = | mc — A)g(A) da , 


Notice in Eq. (1) that x; depends on the current input m, and on the state of the regis- : 
ter defined by the previous L message bits. Also notice that a particular message bit 
influences a span of L + 1 successive encoded bits as it shifts through the register. 

To provide the extra bits needed for error control, a complete convolutional 
encoder must generate output bits at a rate greater than the message bit rate r,. This 
is achieved by connecting two or more mod-2 summers to the register and inter- 
leaving the encoded bits via a commutator switch. For example, the encoder in 
Fig. 13.3-2 generates n = 2 encoded bits 


xj—mjj,Qm,,Qm, x; = mm; [2] 
which are interleaved by the switch to produce the output stream 
X = xxi xxx 


The output bit rate is therefore 2r, and the code rate is R, = 1/2—like an (n, k) 
block code with R, = k/n = 1/2. 
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Figure 13.3-2 Convolutional encoder with n = 2, k= 1, and L= 2. 





However, unlike a block code, the input bits have not been grouped into words. 
Instead, each message bit influences a span of n(L + 1) = 6 successive output bits. 
The quantity n(L +1) is called the constraint length measured in terms of 
encoded output bits, whereas L is the encoder's memory measured in terms of input 
message bits, We say that this encoder produces an (n, k, L) convolutional code+ 
with n = 2, k = l, and L = 2. 

Three different but related graphical representations have been devised for the 
study of convolutional encoding: the code tree, the code trellis, and the state dia- 
gram. We'll present each of these for our (2, 1, 2) encoder in Fig. 13.32, starting 
with the code tree. In accordance with normal operating procedure, we presume that 
the register has been cleared to contain all Os when the first message bit m, arrives. 
Hence, the initial state is m..,mg = 00 and Eq. (2) gives the output xix = 00 if 
m, = Qor xixi = ll ifm, =l. 

The code tree drawn in Fig. 13.3-3 begins at a branch point or node labeled a 
representing the initial state. If m, = 0, you take the upper branch from node a to 
find the output 00 and the next state, which is also labeled a since mg m, = 00 in this 
case. If m, = 1, you take the lower branch from a to find the output 11 and the next 
state mom, = 01 signified by the label b. The code tree progressively evolves in this 
fashion for each new input bit. Nodes are labeled with letters denoting the current 
state m; 4 m;..,; you go up or down from a node, depending on the value of m; each 
branch shows the resulting encoded output x; x} calculated from Eq. (2), and it ter- 
minates at another node labeled with the next state. There are 2’ possible branches 


for the jth message bit, but the branch pattern begins to repeat at j = 3 since the reg- 
ister length is L + 1 = 3. 


tNotation for convolutional codes has not been standardized and varies from author to author, as 
does the definition of constraint length. 
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Figure 13.3-3 Code tree for (2, 1, 2) encoder. 


Having observed repetition in the code tree, we can construct a more compact 
picture called the code trellis and shown in Fig. 13.3-4a. Here, the nodes on the left 
denote the four possible current states, while those on the right are the resulting next 
states. A solid line represents the state transition or branch for m; = 0, and a broken 
line represents the branch for m; = 1. Each branch is labeled with the resulting out- 
put bits x;x7. Going one step further, we coalesce the left and right sides of the trel- 
lis to obtain the state diagram in Fig. 13.3-4b. The self-loops at nodes a and d rep- 
resent the state transitions a-a and d-d. 

Given a sequence of message bits and the initial state, you can use either the 
code trellis or state diagram to find the resulting state sequence and output bits. The 
procedure is illustrated in Fig. 13.3—4c, starting at initial state a. 

Numerous other convolutional codes are obtained by modifying the encoder in 
Fig. 13.3-2. If we just change the connections to the mod-2 summers, then the code 
tree, trellis, and state diagram retain the same structure since the states and branch- 
ing pattern reflect only the register contents. The output bits would be different, of 
course, since they depend specifically on the summer connections. 
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Figure 13.3-4 (a) Code trellis; (b) state diagram for (2, 1, 2) encoder; (c) illustrative sequence. 


If we extend the shift register to an arbitrary length L + 1 and connect itto n = 
2 mod-2 summers, we get an (n, k, L) convolutional code with k = 1 and code rate 
R, = 1/n = 1/2. The state of the encoder is defined by L previous input bits, so the 
code trellis and state diagram have 2 different states, and the code-tree pattern 
repeats after j = L + 1 branches. Connecting one commutator terminal directly to 
the first stage of the register yields the encoded bit stream 


X = my xix my xx maxsxts [3] 


which defines a systematic convolutional code with R, — 1/n. 

Code rates higher than 1/n require k = 2 shift registers and an input distributor 
Switch. This scheme is illustrated by the (3, 2, 1) encoder in Fig. 13.3-5. The mes- 
sage bits are distributed alternately between k = 2 registers, each of length L + 1 = 
2. We regard the pair of bits m;..,m; as the current input, while the pair m;_3 m; 
constitute the state of the encoder. For each input pair, the mod-2 summers generate 
n — 3 encoded output bits given by 


Pn Hol 
x; = mj- D mj. m; xj = mj. mj., Dm; (4] 


m 


xj = mjm; 
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Input rate r, 





Output rate 





Figure 13.3-5 (3, 2, 1) encoder. 


Thus, the output bit rate is 3r,/2, corresponding to the code rate R, = k/n = 2/3. 
The constraint length is n(L + 1) = 6, since a particular input bit influences a span 
of n = 3 output bits from each of its L + 1 = 2 register positions. 

Graphical representation becomes more cumbersome for convolutional codes 
with k > 1 because we must deal with input bits in groups of 2*. Consequently, 2* 
branches emanate and terminate at each node, and there are 2“ different states. 
As an example, Fig. 13.3-6 shows the state diagram for the (3, 2, 1) encoder in 
Fig. 13.3—5. The branches are labeled with the k = 2 input bits followed by the 
resulting n — 3 output bits. 

The convolutional codes employed for FEC systems usually have small values 
of n and k, while the constraint length typically falls in the range of 10 to 30. All 


convolutional encoders require a commutator switch at the output, as shown in 
Figs. 13.3-2 and 13.3-5. 








In any case, convolutional encoding hardware is simpler than the hardware for block 


encoding since message bits enter the register unit at a steady rate r, and an input 
buffer is not needed. 
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Figure 13.3-6 State diagram for (3, 2, 1) encoder. 


The (2, 1, 2) convolutional encoder of Fig. 13.3-2 can be expressed as two genera- 
tor polynomials with G,(D) = 1 + D! + D? and G,(D) = 1 + D?. If we trans- 
form the message sequence into polynomial form, the message sequence of (1 1 0 
11100100 0) is thus described as M(D) = 1 + D + D? + D4 + D + D*. 
This transformation of the message and register gains into the D domain is analo- 
gous to the Fourier transform converting a convolution into a multiplication. The 
output from the upper branch of our encoder becomes X; = M(D)G,(D) = 
(l+D+D>+D*+D°+D®\(i+ D+ D?) = (1+.D°+pD7+ D+ D° + D”) 
and thus x; = 100001011110. Note when doing the multiplication operation, 
D' + Dİ = 0. Similarly for the lower branch we have X; = M(D)G,(D) = 
(1-D-D^-D*« D« D®)(1+ D) -(1- DD? D*4 D D' - D* +D") 
and hence x; = 111010111010. With interleaving, the output becomes x; = 


11 01 01 0001 1001 11 11 10 11 00, which is the same result as we obtained using 
convolution. 





EXAMPLE 13.3-1 





Consider a systematic (3, 1, 3) convolutional code. List the possible states and deter- EXERCISE 13.3-1 
mine the state transitions produced by m; — 0 and m; — 1. Then construct and label 

the state diagram, taking the encoded output bits to be mj, m; .,&9 mj; and 

mj- D mj. (See Fig. P13.3-4 for a convenient eight-state pattern.) 
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Free Distance and Coding Gain 


We previously found that the error-control power of a block code depends upon its 
minimum distance, determined from the weights of the codewords. A convolutional 
code does not subdivide into codewords, so we consider instead the weight w(X) of 
an entire transmitted sequence X generated by some message sequence. 
The free distance of a convolutional code is then defined to be 

dj (WOO Las X € 000-- [5] 
The value of d; serves as a measure of error-control power. 

It would be an exceedingly dull and tiresome task to try to evaluate d, by listing 
all possible transmitted sequences. Fortunately, there's a better way based on the 
normal operating procedure of appending a "tail" of Os at the end of a message to 
clear the register unit and return the encoder to its initial state. This procedure elim- 
inates certain branches from the code trellis for the last L transitions. 

Take the code trellis in Fig. 13.3-4a, for example. To end up at state a, the next- 
to-last state must be either a or c so the last few branches of any transmitted 
sequence X must follow one of the paths shown in Fig. 13.3—7. Here the final state is 
denoted by e, and each branch has been labeled with the number of 1s in the 
encoded bits—which equals the weight associated with that branch. The total weight 
of a transmitted sequence X equals the sum of the branch weights along.tbe path of 
X. In accordance with Eq. (5), we seek the path that has the smallest branch-weight 
sum, other than the trivial all-zero path. 

Looking backwards L + 1 = 3 branches from e, we locate the last path that 
emanates from state a before terminating at e. Now suppose all earlier transitions 
followed the all-zero path along the top line, giving the state sequence aa--- abce. 
Since an a-a branch has weight 0, this state sequence corresponds to a minimum- 
weight nontrivial path. We therefore conclude that d; = 0 - 0 - = +0442 + 
1 +2 = 5. There are other minimum-weight paths, such as aa:::abcae and 
aa **: abcbce, but no nontrivial path has less weight than d; = 5. 

Another approach to the calculation of free distance involves the generating 
function of a convolutional code. The generating function may be viewed as the 





Figure 13.3-7 Termination of (2, 1, 2) code trellis. 
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transfer function of the encoder with respect to state transitions. Thus, instead of 
relating the input and output bits streams by convolution, the generating function 
relates the initial and final states by multiplication. Generating functions provide 
important information about code performance, including the free distance and 
decoding error probability. 

We'll develop the generating function for our (2, 1, 2) encoder using the modi- 
fied state diagram in Fig. 13.3-8a. This diagram has been derived from Fig. 13.3-4b 
with four modifications. First, we've eliminated the a-a loop which contributes 
nothing to the weight of a sequence X. Second, we've drawn the c-a branch as the 
final c-e transition. Third, we've assigned a state variable W, at node a, and like- 
wise at all other nodes. Fourth, we've labeled each branch with two “gain” variables 
D and I such that the exponent of D equals the branch weight (as in Fig. 13.3-7), 
while the exponent of J equals the corresponding number of nonzero message bits 
(as signified by the solid or dashed branch line). For instance, since the c-e branch 
represents xix” = 11 and m; = 0, it is labeled with D?I? = D*. This exponential 
trick allows us to perform sums by multiplying the D and J terms, which will become 
the independent variables of the generating function. 






P4 
W, DI W W, = TD, DW, 
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Figure 13.3-8 (a) Modified state diagram for (2, 1, 2) encoder; (b equivalent block diagram. 
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Our modified state diagrain now looks like a signal-flow graph of the type 
sometimes used to analyze feédback systems. Specifically, if we treat the nodes as 
summing junctions and the DI terms as branch gains, then Fig. 13.3-8a represents 
the set of algebraic state equations 


W, = D?IW, + IW, — W, 2 DW, -- DW, - [6d]: 
W, = DIW, + DIW; | W, = DW, 


The encoder's generating function T(D, I) can now be defined by the input-output 
equation 


T(D, 1) Ê W/W, [6b] 


These equations are also equivalent to the block diagram in Fig. 13.3—8b, which fur- 
ther empbasizes the relationships between the state variables, the branch gains, and 
the generating function. Note that minus signs have been introduced here so that the 
two feedback paths c-b and d-d correspond to negative feedback. 

Next, the expression for T(D, J) is obtained by algebraic solution of Eq. (6), or 
by block-diagram reduction of Fig. 13.3—85 using the transfer-function relations for 
parallel, cascade, and feedback connections in Fig. 3.1-8. (If you know Mason's 


rule, you could also apply it to Fig. 13.3-8a.) Any of these methods produces the 
final result 





D?I 
T(D, I) = Ta 2DI [7a] 
= D'I + 2D8P? + AD'P + +: 
= X, rcp [7b] 
d=5 


where we've expanded (1 — 2DI) ! to get the series in Eq. (7b). Keeping in mind 
that 7(D, D represents all possible transmitted sequences that terminate with a c-e 
transition, Eq. (7b) has the following interpretation: for any d = 5, there are exactly 
24^? valid paths with weight w(X) = d that terminate with a c-e transition, and 
those paths are generated by messages containing d — 4 nonzero bits. The smallest 
value of w(X) is the free distance, so we again conclude that d, — 5. 

As a generalization of Eq. (7), the generating function for an arbitrary convolu- 
tional code takes the form 


co oo 
T(D,I) = X, SAG, i)D?T: [8] 

d=d, i=l 
Here, A(d, i) denotes the number of different input-output paths through the modi- 
fied state diagram that have weight d and are generated by messages containing i 

nonzero bits. 

Now consider a received sequence Y = X + E, where E represents transmission 
errors. The path of Y then diverges from the path of X and may or may not be a valid 
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path for the code in question. When Y does not correspond to a valid path, a maximum- 
likelihood decoder should seek out the valid path that has the smallest Hamming dis- 
tance from Y. Before describing how such a decoder might be implemented, we'll 
state the relationship between generating functions, free distance, and error probabil- 
ity in maximum-likelihood decoding of convolutional codes. 

If transmission errors occur with equal and independent probability œ per bit, 
then the probability of a decoded message-bit error 1s upper-bounded by 


< 1 ôT(D, I) 
"eT k dE. ene ccm 
When a is sufficiently small, series expansion of T(D, I) yields the approximation 


M(d 
B, © MD utn Va «1i [10] 


[9] 


where 


eo 


M(d,) = >) iA(dp i) 


i=l 
The quantity M(d,) simply equals the total number of nonzero message bits over all 
minimum-weight input-output paths in the modified state diagram. 

Equation (10) supports our earlier assertion that the error-control power of a 
convolutional code depends upon its free distance. For a performance comparison 
with uncoded transmission, we'll make the usual assumption of gaussian white 
noise and (S/N)4 = 2R,y, = 10 so Eq. (10), Sect. 13.1, gives the transmission error 
probability 

Qa = (AmR,y;) Pe RAs 
The decoded error probability then becomes 
M(d,)2% 
Se 
k(4aR y, 


whereas uncoded transmission would yield 


s —(R. d/2yy, [11] 


1 


P abe ^ > HC 
a (4r y)? 


E [12] 
Since the exponential terms dominate in these expressions, we see that convolu- 
tional coding improves reliability when R,dj/2 > 1. Accordingly, the quantity 
R.dg/2 is known as the coding gain, usually expressed in dB. 

Explicit design formulas for d, do not exist, unfortunately, so good convolutional 
codes must be discovered by computer search and simulation. Table 13.3-1 lists the 
maximum free distance and coding gain of convolutional codes for selected values of 
n, k, and L. Observe that the free distance and coding gain increase with increasing 
memory Z when the code rate R, is held fixed. All listed codes are nonsystematic; a 


583 


584 


CHAPTER 13. € Channel Coding and Encryption 


TN 
fax 





Table 13.3-1 Maximum free distance and coding gain of selected convolutional codes 
" k R. L 4, R.d,/2 
4 1 1/4 3 13 1.63 
3 1 1/3 3 10 1.68 

a 1 1/2 3 6. 1.50. 

6 10 2.50 

9 12 3.00 

2 2/3 3 2.33 

4 3 3/4 3 8 3.00 


systematic convolutional code has a smaller d; than an optimum nonsystematic code 
with the same rate and memory. 

We should also point out that some convolutional codes exhibit catastrophic 
error propagation. This occurs when a finite number of channel errors causes an 
infinite number of decoding errors, even if subsequent symbols are correct. 
Encoders that exhibit this behavior will show in their state diagram a state where a 


Output 





Figure 13.3-9 Encoder subject to catastrophic error propagation. (a) Encoder; {b} state 
diagram. 
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given nonzero input causes a transition back to that state itself producing a zero out- 
put. Catastrophic codes can also be identified if their generator polynomials have a 
common factor of degree at least one. For example, Fig. 13.3-9 shows a catastrophic 
encoder and its state diagram. Note at state d a nonzero input causes the encoder to 
branch back to itself with a zero output. The code is also found to be catastrophic 
because generator polynomials G,(D) = 1 + D and G,(D) = D + D? have the 
common factor of 1 + D. 








The (2, 1, 2) encoder back in Fig. 13.32 has T(D,I) = 
aT(D, I)/al = D?/(1 — 2DI)°. Equation (9) therefore gives 
2'[a(1— a) |? 
p, lal - 9) 
[L—- 4Va(1l — a)? 


and the small-o approximation agrees with Eq. (10). Specifically, in Fig. 13.3-8a 
we find just one minimum-weight nontrivial path abce, which has w(X) = 5 = d; 
and is generated by a message containing one nonzero bit, so M(d,) = 1. 

If y, = 10, then Ry, = 5, a = 8.5 X 10 ^, and maximum-likelihood decod- 
ing yields P,e ^ 6.7 X 107’, as compared with P pe ^ 4.1 X 107°. This rather 
small reliability improvement agrees with the small coding gain R, d;/2 = 5/4. 


A 25g5? 





Let the connections to the mod-2 summers in Fig. 13.3-2 be changed such that 
x; = m, and x} = m; Qm, , m, 

(a) Construct the code trellis and modified state diagram for this systematic 
code. Show that there are two minimum-weight paths in the state diagram, and that 
d, = 4 and M(d;) = 3. It is not necessary to find T(D, D. 

(b) Now assume y, = 10. Calculate a, P,,, and Puse What do you conclude 
about the performance of a convolutional code when R,d,/2 = 1? 


Decoding Methods 


There are three generic methods for decoding convolutional codes. At one extreme, 
the Viterbi algorithm executes maximum-likelihood decoding and achieves optimum 
performance but requires extensive hardware for computation and storage. At the 
other extreme, feedback decoding sacrifices performance in exchange for simplified 
hardware. Between these extremes, sequential decoding approaches optimum per- 


- formance to a degree that depends upon the decoder's complexity. We'll describe 


how these methods work with a (2, 1, L) code. The extension to other codes is con- 
ceptually straightforward, but becomes messy to portray for k > 1. 

Recall that a maximum-likelihood decoder must examine an entire received 
sequence Y and find a valid path that has the smallest Hamming distance from Y. 


DiI/(1 — 2DI), so 
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However, there are 2" possible paths for an arbitrary message sequence of N bits (or 
Nn/k bits in Y), so an exhaustive comparison of Y with all valid paths would be an 
absurd task in the usual case of N œ 1. The Viterbi algorithm applies maximum- 
likelihood principles to limit the comparison to 2* surviving paths, independent of 
N, thereby bringing maximum-likelihood decoding into the realm of feasibility. 

A Viterbi decoder assigns to each branch of each surviving path a metric that 
equals its Hamming distance from the corresponding branch of Y. (We assume here 
that Os and 1s have the same transmission-error probability; if not, the branch metric 
must be redefined to account for the differing probabilities.) Summing the branch 
metrics yields the path metric, and Y 1s finally decoded as the surviving path with the 
smallest metric. To illustrate the metric calculations and explain how surviving 
paths are selected, we'll walk through an example of Viterbi decoding. 

Suppose that our (2, 1, 2) encoder is used at the transmitter, and the received 
sequence starts with Y — 11 01 11. Figure 13.3-10 shows the first three branches of 
the valid paths emanating from the initial node ay in the code trellis. The number in 
parentheses beneath each branch is the branch metric, obtained by counting the differ- 
ences between the encoded bits and the corresponding bits in Y. The circled number at 
the right-hand end of each branch is the running path metric, obtained by summing 
branch metrics from dp. For instance, the metric of the path agb,c;b4 is 0 + 2 +2 = 4. 

Now observe that another path aga,a;b, also arrives at node b, and has a smaller 
metric 2 + 1 + 0 = 3. Regardless of what happens subsequently, this path will have a 
smaller Hamming distance from Y than the other path arriving at b, and is therefore 
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Figure 13.3-10 
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more likely to represent the actual transmitted sequence. Hence, we discard the larger- 
metric path, marked by an X, and we declare the path with the smaller metric to be the 
survivor at this node. Likewise, we discard the larger-metric paths arriving at nodes a5, 
c, and dù, leaving a total of 2” = 4 surviving paths. The fact that none of the surviv- 
ing path metrics equals zero indicates the presence of detectable errors in Y. 

Figure 13.3—11 depicts the continuation of Fig. 13.3—-10 for a complete message 
of N — 12 bits, including tail Os. All discarded branches and all labels except the 
running path metrics have been omitted for the sake of clarity. The letter T under a 
node indicates that the two arriving paths had equal running metrics, in which case 
we just flip a coin to choose the survivor (why?). The maximum-likelihood path fol- 
lows the heavy line from aj to a, and the final value of the path metric signifies at 
least two transmission errors in Y. The decoder assumes the corresponding transmit- 
ted sequence Y + E and message sequence M written below the trellis. 

A Viterbi decoder must calculate two metrics for each node and store 2" sur- 
viving paths, each consisting of N branches. Hence, decoding complexity increases 
exponentially with L and linearly with N. The exponential factor limits practical 
application of the Viterbi algorithm to codes with small values of L. 

When N >> 1, storage requirements can be reduced by a truncation process 
based on the following metric-divergence effect: if two surviving paths emanate 
from the same node at some point, then the running metric of the less likely path 
tends to increase more rapidly than the metric of the other survivor within about 5L 
branches from the common node. This effect appears several times in Fig. 13.3-11; 
consider, for instance, the two paths emanating from node b,. Hence, decoding need 
not be delayed until the end of the transmitted sequence. Instead, the first k message 
bits can be decoded and the first set of branches can be deleted from memory after 





Figure 13.3-11 Illustration of the Viterbi algorithm for maximum-likelihood decoding. 
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the first 5Ln received bits have been processed. Successive groups of k message bits 
are then decoded for each additional n bits received thereafter. 

Sequential decoding, which was invented before the Viterbi algorithm, also 
relies on the metric-divergence effect. A simplified version of the sequential algo- 
rithm is illustrated in Fig. 13.3-12a, using the same trellis, received sequence, and 
metrics as in Fig. 13.3-11. Starting at ag, the sequential decoder pursues a single 
path by taking the branch with the smallest branch metric at each successive node. If 
two or more branches from one node have the same metric, such as at node d,, the 
decoder selects one at random and continues on. Whenever the current path happens 
to be unlikely, the running metric rapidly increases and the decoder eventually 
decides to go back to a lower-metric node and try another path. There are three of 
these abandoned paths in our example. Even so, a comparison with Fig. 13.3-11 
shows that sequential decoding involves less computation than Viterbi decoding. 

The decision to backtrack and try again is based on the expected value of the 
running metric at a given node. Specifically, if a is the transmission error probabil- 
ity per bit, then the expected running metric at the jth node of the correct path equals 


Survivor 


Running metric 








Figure 13.3-12 Illustration of sequential decoding. 
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jna, the expected number of bit errors in Y at that point. The sequential decoder 
abandons a path when its metric exceeds some specified threshold A above jna. If 
no path survives the threshold test, the value of A is increased and the decoder back- 
tracks again. Figure 13.3-125 plots the running metrics versus j, along with jna and 
the threshold line jna + A for a = 1/16 and A = 2. 

Sequéntial decoding approaches the performance of maximum-likelihood 
decoding when the threshold is loose enough to permit exploration of all probable 
paths. However, the frequent backtracking requires more computations and results 
in a decoding delay significantly greater than Viterbi decoding. A tighter threshold 
reduces computations and decoding delay but may actually eliminate the most. 
probable path, thereby increasing the output error probability compared to that of 
maximum-likelihood decoding with the same coding gain. As compensation, 
sequential decoding permits practical application of convolutional codes with large L 
and large coding gain since the decoder’s complexity is essentially independent of L. 

We’ve described sequential decoding and Viterbi decoding in terms of algo- 
rithms rather than block diagrams of hardware. Indeed, these methods are usually 
implemented as software that performs the metric calculations and stores the path 
data. When circumstances preclude algorithmic decoding, and a higher error proba- 
bility is tolerable, feedback decoding may be the appropriate method. A feedback 
decoder acts in general like a “sliding block decoder" that decodes message bits one 
by one based on a block of L or more successive tree branches. We'll focus on the 
special class of feedback decoding that employs majority logic to achieve the sim- 
plest hardware realization of a convolutional decoder. 

Consider a message sequence M = m,m,--- and the systematic (2, 1, L) 
encoded sequence 


X cxx xS [13a] 
where 
L 
x; = m,; XL > mjigj (mod-2) [135] 
m 


We'll view the entire sequence X as a codeword of indefinite length. Then, borrow- 
ing from the matrix representation used for block codes, we'll define a generator 
matrix G and a parity-check matrix H such that 

X — MG XH? =00--- 


To represent Eq. (13), G must be a semi-infinite matrix with a diagonal structure 
given by 


|. D Bio. 0 z 
l g& 0g Qo See Og 
G= et Uno fe : [14a] 
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This matrix extends indefinitely to the right and down, and the triangular blank 
spaces denote elements that equal zero. The parity-check matrix is 


80 1 
8 0 g l 
& 0 g 1 
H=. i . . a. os [145] 
& 0 
g, 0 


which also extends indefinitely to the right and down. 


Next, let E be the transmission error pattern in a received sequence Y = X + E. 
We'll write these sequences as 


Y= yiytyiyh B= elefeses-- 
so that y; = m; € ej. Hence, given the error bit e;, the jth message bit is 
m; = yj De; [15] 
A feedback decoder estimates errors from the syndrome sequence 
= YH’ = (X + EH? = EHT 
Using Eq. (14b) for H, the jth bit of S is 


L L 
Sj = > yj-igi Dy = Meligeej [16] 
i=0 i=0 
where the sums are mod-2 and it's understood that yj; = ej-; = Oforj ~ i S 0. 


As a specific example, take a (2, 1, 6) encoder with gy = g; = g5 = gg = l and 
81 = 83 84 =Q, so 


5; = yj-6 DYj-s D yj-2 Dy; Dy} [17a] 
= eis eje; 5 eje; [175] 


Equation (17a) leads directly to the shift-register circuit for syndrome calculation 
diagrammed in Fig. 13.3~13. Equation (17D) is called a parity-check sum and will 
lead us eventually to the remaining portion of the feedback decoder. 

To that end, consider the parity-check table in Fig. 13.3-14a where checks indi- 
cate which error bits appear in the sums 5; 6, 5;-4, Sj-1, and sj. This table brings out 
the fact that e;..& is checked by all four of the listed sums, while no other error bit is 
checked by more than one. Accordingly, this set of check sums is said to be orthog- 


onal on ej. The tap gains of the encoder were carefully chosen to obtain orthogo- 
nal check sums. 
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Figure 13.3-13 Shift-register circuit for syndrome calculation for a systematic (2, 1, 6] code. 
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Figure 13.3-14 Parity-check table for a systematic (2, 1, 6) code. 


When the transmission error probability 1s reasonably small, we expect to find 
at most one or two errors in the 17 transmitted bits represented by the parity-check 
table. If one of the errors corresponds to ej- = 1, then the four check sums will 
contain three or four ls. Otherwise, the check sums contain less than three 1s. 
Hence, we can apply these four check sums to a majority-logic gate to generate the 
most likely estimate of e;_.. 

Figure 13.3-15 diagrams a complete majority-logic feedback decoder for our 
systematic (2, 1, 6) code. The syndrome calculator from Fig. 13.3-13 has two out- 
puts, yj- and s;. The syndrome bit goes into another shift register with taps that con- 
nect the check sums to tbe majority-logic gate, whose output equals the estimated 
error é;_¢. The mod-2 addition y;.& 25 & = mj. g carries out error correction based 
on Eq. (15). The error is also fed back to the syndrome register to improve the relia- 
bility of subsequent check sums. This feedback path accounts for the name feedback 
decoding. 

Our example decoder can correct any single-error or double-error pattern in six 
consecutive message bits. However, more than two transmission errors produces 
erroneous corrections and error propagation via the feedback path. These effects 
result in a higher output error than that of maximum-likelihood, decoding. See 
Lin and Costello (1983, Chap. 13) for the error analysis and further treatment of 
majority-logic decoding. 
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Figure 13.3-15 Majority-logic feedback decoder for a systematic (2, 1, 6) code. 


Turbo Codes 


Turbo codes, or parallel concatenated codes (PCC), are a relatively new class of 
convolutional codes first introduced in 1993. They have enabled channel capacities 
to nearly reach the Shannon limit. Our coverage of turbo codes will be primarily 
descriptive. Detailed treatments of turbo codes are given by Berrou et al. (1993), 
Berrou (1996), Hagenauer et al. (1996), and Johannesson and Zigangirov (1999). 

Shannon's theorem for channel capacity assumes random coding with the bit 
error rate (BER) approaching zero as the code's block or constraint length 
approaches infinity. It 1s therefore not feasible to decode a completely truly random 
code. Increases in code complexity accomplished by longer block or constraint 
lengths require a corresponding exponential increase in decoding complexity. We 
can, however, achieve a given BER if the code is sufficiently unstructured, and we 
are willing to pay the associated cost in decoding complexity. This leads us to the 
following paradox by J. Wolfowitz: 





Turbo codes overcome this paradox in that they can be made sufficiently random to 
achieve a given BER and, by using iterative methods, can be efficiently and feasibly 
decoded. 

Figure 13.3-16 illustrates a turbo encoder. Here we have the parallel concatena- 
tion of two codes produced from two rate 1/2 recursive systematic convolutional 
(RSC) encoders. The second RSC is preceded by a pseudo-random interleaver 
whose length can vary from 100-10,000 bits or more to permute the symbol 
sequence. The RSCs are not necessarily identical, nor do they have to be convolu- 
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Figure 13.3-16 Turbo encoder. 





Systematic bits 


Xk 








Parity-check bits 
Yk 


Figure 13.3-17 Recursive systematic convolutional [RSC) encoder with R = 1/2, G, = 23, 
G, = 35, and L = 2. 


tional, and more than two can be used. Both encoders produce the parity-check bits. 
These parity bits and the original bit stream (called the systematic bits) are multi- 
plexed and then transmitted. As given, the overall rate is 1/3. However, we may be 
able to increase this rate to R = 1/2 using a process of puncturing, whereby some 
bits are deleted. This could be done, for example, by eliminating the odd parity bits 
from the first RSC and the even parity bits from the second RSC. 

Figure 13.3~17 shows a G,, G;-RSC encoder that has been used for turbo coding. 
Unlike the nonsystematic convolutional (NSC) encoders described earlier in this sec- 
tion, this encoder is systematic in that the message and parity bits are separate. The 
feedback connections from the state outputs make this encoder recursive, and thus sin- 
gle output errors produce a large quantity of parity errors. For this particular encoder, 
the polynomial describing the feedback connections is 1 + D? + D* = 10 011, = 23, 
and the polynomial for the output is 1 + D + D? + D*= 11101, = 353. Hence, the 
literature often refers this to as a G, = 23, G; = 35, or simply a (23, 35) encoder. 

The turbo decoder is shown in Fig. 13.3-18. It consists of two maximum a pos- 
terior (MAP) decoders and a feedback path that works in similar manner to that of 
an automobile turbo engine, hence the term turbo code. The first decoder takes the 
information from the received signal and calculates the a posteriori probability 
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Figure 13.3-18 Turbo decoder. 


(APP) value. This value is then used as the a priori probability value for the second 
decoder. The output is then fed back to the first decoder where the process repeats in 
an iterative fashion with each iteration producing more refined estimates. 

Instead of using the Viterbi algorithm, which minimizes the error for the entire 
sequence, this MAP decoder uses a modified form of the BCJR (Bahl, Cocke, 
Jelinek, and Raviv, 1972) algorithm that takes into account the recursive character of 
the RSC codes and computes a log-likelihood ratio to estimate the APP for each bit. 

The results by Berrou et al. are impressive. When encoding using rate 
R = 1/2, G, = 37 and G, = 21, 65,537 interleaving, and 18 iterations, they were 
able to achieve a BER of 10 ? at E,/ Ny = 0.7 dB. The main disadvantage of turbo 
codes with their relatively large codewords and iterative decoding process is their 
long latency. À system with 65,537 interleaving and 18 iterations may have too long 


a latency for voice telephony. On the other hand, turbo codes have excellent per- 
formance in deep space applications. 





13.4 DATA ENCRYPTION 


Up to now, the purpose of channel coding was to reduce errors. Later, in Chapter 15, 
we will see how coding can be employed to spread out the information over the fre- 
quency band to increase a system’s immunity to jamming. However, in this section 
we want to look at encryption coding, in which the goal is secure communication. 
We will first consider the basics of encryption, and then look at two popular encryp- 
tion methods: the data encryption standard (DES) and Rivest-Shamir-Adleman 
(RSA) systems. 

Encryption is the transformation of plaintext into ciphertext; decryption is the 
inverse of this process. The cipher is the set of transformations, and the keys are the 
transformation parameters. There are two types of ciphers: Block encryption is 
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Figure 13.4-1 Secret key system. 
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Figure 13.4-2 Public/private key system. 


where fixed blocks of plaintext are operated on in a combinatorial fashion, and 
stream encryption is similar to convolution encoding in that small segments of the 
plaintext are processed. 

A single or secret key encryption system 1s shown in Fig. 13.4-1. Here the 
plaintext sequence x is transformed into a ciphertext sequence y as 


y = f(x, Esas) n 


where f is an encryption function consisting of a combination of substitutions, per- 
mutations, and/or mathematical operations. Parameter Kecer 18 a secret key that 
determines the coding and decoding parameters. The message recipient deciphers 
the message by applying an inverse function 


“= fx K secret) [2] 


With secret key encryption, the keys for the sender and recipient are identical 
and must be kept secret. Therefore, to ensure security, key distribution becomes a 
challenge. It must occur over a secure channel. That could be the postal system, 
another electronic communications channel, or perhaps carrier pigeon. The point is 
that the message is only as secure as the key. If an intruder is able to steal the key, the 
message is no longer secure. Periodically changing the key may also foil an intruder. 

Now let's consider the public-key system of Fig. 13.4-2 where the sender and 
recipient have a different key so 


y= F(x, k ouptic) [3] 
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and 


x= ae: K private) [4] 


Let's say Alice wants to send a secret message to Nate. Nate's public directory 
lists both his address and public key, k 4. Anytime someone wants to send a secret 
message to Nate, they encrypt it using his public key and the transform of Eq. (3). In 
this case, Alice sends ciphertext y to Nate. Nate then decrypts the message using the 
transform of Eq. (4) and his private key, k privates which is hidden from the world. The 
transforms in Eqs. (3) and (4) are designed such that the inverse of each function can 
only be computed if we know the other key. In other words, message x can only be 
extracted from sequence y if the recipient knows the private key; it is nearly impos- 
sible to extract x from y if we only know function f and the public key. 

We can also use this system for message authentication, when we want to send 
a message with an electronic signature. Let's say Nate wants to send Alice a signed 
message. Nate encrypts his signature using his private key and Eq. (4) and then 


sends the sequence x to Alice. Alice then uses Nate's public key and Eq. (3) to read. 


Nate's signature. In this scenario, Nate's signature can only be forged if the forger 
knows his private key. 
Before we study the more widely used and sophisticated encryption systems, 


let's look at the two elementary operations, substitution and permutation, which are 
the basis for other systems. 


Substitution is where plaintext symbols are converted into another set of sym- 
bols. A simple example is as follows. 


Plaintext: ABCDEFGHIJKLMNOPQRSTUVWXYZ 
Substitute tex: ZVECLMOFPTXBIKSDQUGWNRYJAH 


Therefore, if we wanted to send the message “THE DOG BITES” we would per- 
form the substitution and send 


"WFL-CSO-VPWLG" 


But we could also transpose or permute the text by some fixed rule where the 
characters are put in fixed block lengths and circularly shifted left or right. For 
example, we could put the characters in groups of four and then permute by per- 
forming circular shift to the right by two: 


Plaintext: X1X5,X4X, 
Permuted text: X4 X4X, X, 
If we extend our previous example, our substituted text becomes, 
Substituted and permuted text: *L-WFO-CSWLVP-G" 


If the message is sufficiently long, the principle flaw of this system is that it can 
be broken by frequency or statistical analysis, where the intruder takes advantage of 
certain letter patterns that occur in the English language. For example, letters e, a, 
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Figure 13.4-5 DES flowchart. 
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expanded to 48 bits using a P-box; (b) The 48-bit result is then exclusively ORed 
with a 48-bit subkey, k; producing a 48-bit word. (c) This 48-bit word is then com- 
pressed to 32 bits using an S-box whose output we call f (R;.,, ki). (d) Function 
f (Ri, kj) is then exclusively ORed with the leftmost 32 bits, L;_;, producing the 
right 32-bit word, R;, for the next round. The new left output is equal to the former 
right input. Mathematically, a round consists of 


L;i = Ri; i= 1,2,...16 [5a] 
R, = Lif (Ropki T. [5b] 


The process to generate the key is shown in Fig. 13.4-7. We first start out with 
a 64-bit key that includes 8 parity bits. A. 64- to 56-bit P-box strips off these parity 
bits leaving us with a 56-bit key. This key is then divided into two 28-bit words. 
Each 28-bit word undergoes one or two left shifts. The two shifted 28-bit words are 
then fed to a 56- to 48-bit P-box whose output is subkey k,. This process is then 
repeated 15 more times to generate an additional 15 subkeys. In the NIST scheme of 
FIPS-46, stages 1, 2, 9, and 16 undergo only one left shift, whereas the other stages 
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undergo two left shifts. To convert the ciphertext back into plaintext, we employ the 
reverse process of Fig. 13.4—5. 

The DES structures and connections of the P- and S-boxes of Figs. 13.4-6 and 
13.4—7 have been standardized and are fully described in paper FIPS-46. Therefore, 
a given key is the sole determining factor of how the data is encrypted. Let's now 
look at the details of a few of the more interesting blocks. The 32- to 48-bit expan- 
sion block used in Fig. 13.4—6 is shown in Fig. 13.4-8. Here, eight 4-bit words are 
expanded to make eight 6-bit words by repeating the edge bits. An expansion block 
causes intruder errors to be magnified, thereby making it more difficult to break the 
code. Fig. 13.4—9 illustrates the 48- to 32-bit compression block used in calculating 
f (Ri, kj). Again, the details of the S-block, S, and the permutation block are given 
in the FIPS-46 paper. 

For increased security, there is also Triple DES, which consists of three stages 
of DES encryption, the first and last using the same key and the middle one using a 
different key. Three stages of encryption enable a triple DES to be compatible with 
older single DES systems whereby a Triple DES can decrypt single DES data. 




















Figure 13.4-8 32- to 48-bit expansion. 
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Rivest-Shamir-Adleman System 


As previously mentioned, the significant challenge of secret-key systems is secure 
but fast and efficient key distribution. Not only must the key channel be secure, but 
in today's e-commerce environment, the key must be readily available. This pre- 
cludes such channels as the conventional mail system for key distribution. To over- 
come this challenge, the Rivest-Shamir-Adleman (RSA) (1978) public key system 
was developed. While DES’s strength is based on the complexity of its algorithm, 
the RSA's security 1s based on the inherent difficulty of determining the prime fac- 
tors of large numbers. The following elementary example illustrates RSA encryp- 


tion in which we want to send an encrypted message consisting of letter "x," whose 
value in ASCII is 88, and then decrypt the received message. 
1. Pick two prime numbers, p and q, and calculate their product or 

n = pq [6] 


Both p and q are kept secret. In this example, use p — 13,q — 17, and then 
n = pq = 221. 
2. Compute the Euler-quotient function 
p(n) = (p — 1)(q ^ 1) = 12 X 16 = 192 (7] 


3. Pick another prime number, e, that is relatively prime to (n) (i.e., the prime 
factors of (n) do not include e). In this case, pick e = 5, since 5 is not a prime 
factor of 192. 


The public keys are then e = 5 and n = 221. 
5. A message is then encrypted using the public keys so that 


y 7 x* mod n [8] 


giving y = 88? mod 221 in this example. As an aside, the mod (modulo) opera- 
tion returns the remainder portion of the quotient of the two numbers or 
y = a mod b means y = rem [a + b]. 

In typical cases calculating x* is too large for most practical computers. 
Instead we factor out the base term in Eq. (6), and thus for this example obtain 


y = 88) mod 221 
= (88? mod 221][88? mod 221](88' mod 221] mod 221 
= [9 x 9 X 88] mod 221 = 56 


Our encrypted message is y — 56. 
6. ‘To decrypt the message, y, we apply the following function 


x = y! mod n [9] 


where d is the private key. 
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7. To determine the private key needed to decrypt the message we use the algorithm: 
de = 1 mod (é(n)] [10a] 
where d is less than ó(n). Equivalently, we can get d using Euclid’s algorithm 
with 
de-é(n)xQ-c1 [105] 
where Q is any integer that satisfies Eq. (10D). 


In this example with ¢(n) = 192 and e = 5, we get 5d = 192Q + 1, and 
if Q = 2 we get d = 77. Thus, one of our private keys is d = 77. 


8. To decrypt y we have 
x = 56" mod 221 
Again, to do this calculation, we have to factor y^ mod n to get 
([(56* mod 221)? mod 221](56 mod 221)) mod 221 = 88 = x 


And we have recovered our original message. 


9. In summary, the public keys are n and e, and the private keys are p, q, and d. 
Encryption and decryption are done using these and Eqs. (8) and (9). 


Determining the private key, d, requires that the intruder determine the prime 
factors of public key, n. For large values of n — pq, calculating p and q from only 
knowing n takes a great deal of computation time. Thus, RSA can be made 
extremely secure. Whenever the state of the art of computation improves, making it ` 
easier for an intruder to factor n and thus break the code, we can simply go to a 
larger value of n. RSA isn't the only public key encryption system, but many believe 
itis the most robust. Similar to RSA, there is also the Diffie-Hellman system (1976) 
and the Merkle-Hellman scheme (1978). 





13.5 PROBLEMS 

13.1-1* Calculate the probabilities that a word has no errors, detected errors, and undetected 
errors when a parity-check code with n = 4 is used and o = 0.1. 

13.1-2 Do Prob. 13.1-1 with n = 9 and a = 0.05. 

13.1-3 Consider the square-array code in Fig. 13.1—1. (a) Confirm that if a word has two 


errors, then they can be detected but not corrected. (b) Discuss what happens when 
a word contains three errors. 
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An FEC system contaminated by gaussian white noise must achieve P,, = 1074 
with minimum transmitted power. Three block codes under consideration have the 
following parameters: 





n k ain 
31 26 3 
31 21 5 
31 16 7 


Determine which code should be used, and calculate the power saving in dB com- 
pared to uncoded transmission. 


Do Prob. 13.1-4 with P,, = 107°. 


Calculate a, P,,, and P,,, at y, = 2, 5, and 10 for an FEC system with gaussian 
white noise using a (31, 26) block code having dnin = 3. Plot your results in a form 
like Fig. 13.1-5. 


Do Prob. 13.1-6 for a (31, 21) code having d, = 5. 


A selective-repeat ARQ system with gaussian white noise is to have P,, = 10? 
using one of the following block codes for error detection: 


n k uin 
12 11 2 
15 11 3 
16 11 4 





Calculate r,/r and y, for each code and for uncoded transmission. Then plot y, in 
dB versus r,/r. 


Do Prob. 13.1-8 for P,, = 107°. 
A go-back-N ARQ system has gaussian white noise, y, = 6 dB, r = 500 kbps, and a 


one-way path length of 45 km. Find P,,, the minimum value of N, and the maximum 
value of r, using a (15, 11) block code with d, = 3 for error detection. 


Do Prob. 13.1-10 using a (16, 11) block code with d, = 4. 


. A stop-and-wait ARQ system uses simple parity checking with n = k + 1 for error 


detection. The system has gaussian white noise, r — 10 kbps, and a one-way path 
length of 18 km. Find the smallest value of k such that Pj, = 107° and r, = 7200 
bps. Then calculate y, in dB. 


Do Prob. 13.1-12 with a 60-km path length. 


Derive m as given in Eq. (19) for a go-back-N ARQ system. Hint: If a given word 
has detected errors in i successive transmissions, then the total number of transmit- 
ted words equals 1 + Ni . 
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Consider a hybrid ARQ system using a code that corrects t errors and detects £ > t 
errors per n-bit word. Obtain an expression for the retransmission probability p 
when a < 1. Then take d,,,, = 4 and compare your result with Eq. (15). 


Suppose a hybrid selective-repeat ARQ system uses an (n, k) block code with dnin = 


2t + 2 to correct up to t errors and detect t + 1 errors per word. (a) Assume a «& 1 
to obtain an approximate expression for the retransmission probability p, and show 


that 
P, = C Qr 
ei t+] 


(b) Evaluate o and p for a (24, 12) code with d,;, = 8 when P,, = 10 ?. Then 


assume gaussian white noise and find P,, for uncoded transmission with the same 
value of y,. 


Let U and V be n-bit vectors. (a) By considering the number of 1’s in U, V, and U + 
V, confirm that d(U, V) = w(U) + w(V). (b) Now let U = X - Y and V — Y  Z. 
Show that U + V = X + Zand derive the triangle inequality 


dX, Z) = d(X, Y) + d(Y, Z) 


Let X be a code vector, let Z be any other vector in the code, and let Y be the vector 
that results when X is received with i bit errors. Use the triangle inequality in Prob. 
13.2-1 to show that if the code has dy, 2 € + 1 and if i £ £, then the errors in Y 
are detectable. 


Let X be a code vector, let Z be any other vector in the code, and let Y be the vector 
that results when X 1s received with i bit errors. Use the triangle inequality in Prob. 
13.2—1 to show that if the code has d, = 2t + 1 and if i S t, then the errors in Y 
are correctable. 


A triple-repetition code is a systematic (3, 1) block code generated using the subma- 
trix P = [1 1]. Tabulate all possible received vectors Y and S = YH”. Then deter- 


mine the corresponding maximum-likelihood errors patterns and corrected vectors 
Y +E. 


Construct the lookup table for the (6, 3) block code in Exercise 13.2-1. 


Consider a (5, 3) block code obtained by deleting the last column of the P submatrix 
in Exercise 13.2-1. Construct the lookup table, and show that this code could be 
used for error detection but not correction. 


Let the P submatrix for a (15, 11) Hamming code be arranged such that the row 
words increase in numerical value from top to bottom. Construct the lookup table 
and write the check-bit equations. 


Suppose a block code with t = 1 is required to have k = 6 message bits per word. 
(a) Find the minimum value of n and the number of bits stored in the lookup table. 
(b) Construct an appropriate P submatrix. 
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Do Prob. 13.2-8 with k — 8. 


It follows from Eq. (4) that XH’ = MA with A = GH". Prove Eq. (9) by showing 
that any element of A has the property a; = 0. 


The original (7, 4) Hamming code is a nonsystematic code with 


Ip Tp 
H-|011001 1 
0001101 


(a) Construct the lookup table and explain the rationale for this nonsystematic form. 
(b) Write the equations for s,, s;, and s, taking Y = X = (x, x4 ^^ x7). Then deter- 
mine which are the message and check bits in X, and obtain the check-bit equations. 


The (7, 4) Hamming code can be extended to form an (8, 4) code by appending a 
fourth check bit chosen such that all codewords have even parity. (a) Apply this 
extension process to Table 13.2-1 to show that d, = 4. (b) Find the corresponding 
equation for c4 in terms of the message bits. (c) Explain how the decoding system in 
Fig. 13.22 should be modified to perform double-error detection as well as single- 
error correction. 


Consider a systematic (7, 3) cyclic code generated by G(p) = p^ + p? -- p? - 0 4 1. 
Find Qy(p), C(p), and X when M = (1 0 1). Then take Y = X’ and confirm that 


S(p) = 0. You may carry out the divisions using binary words rather than polynomi- 
als if you wish. 


Do Prob. 13.2-13 with G(p) = pt - Oc p? t pc kl 


Given G(p), the equivalent generator matrix of a systematic cyclic code can be 
found from the polynomials R(p) = rem [p^ "/G(p)], 1 x i € k, which corre- 
spond to the rows of the P submatrix. Use this method to obtain P for a (7, 4) Ham- 
ming code generated by G(p) = p? + 0 + p + 1. (You may carry out the divisions 
using binary words rather than polynomials if you wish.) Compare your result with 
the G matrix in Example 13.2-1. 


Do Prob. 13.2-15 with G(p) = p? + p? + 0 4 1. 


Figure P13.2-17 is a shift-register circuit that divides an arbitrary mth-order polyno- 
mial Z( p) by a fixed polynomial G(p) = p* + g, ,p* ! + = + gp + 1. If the 


. register has been cleared before Z(p) is shifted in, then the output equals the quo- 


tient and the remainder appears in the register after m shift cycles. Confirm the divi- 
sion operation by constructing a table similar to Fig. 13.2-4b, taking 
Z(p) = p*M(p) and G(p) as in Example 13.2-3. 
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ro r 


Figure P13.2-17 





Use the shift-register polynomial divider in Prob. 13.2—17 to diagram a circuit that 
produces the syndrome S(p) from Y(p) for the code in Example 13.2~3. Then con- 
struct a table like Fig. 13.2-4b to show that S(p) = 0 when Y = (1100010). 


Suppose an uncoded system has a P,e = 1075. What is P, and a for the following 
Hamming codes: (a) (7, 4), (b) (15, 11), (c) (31, 26)? 


For a fixed value of y, show that a (7, 4) code has a lower P,, than a (31, 26) code. 
Why is this the case, and what is the advantage of the (31, 26) code? 


Suppose an uncoded system has a P,e = 1075. What would be the overall P,, if the 
data were sent three times but at three times the data rate with the receiver deciding 
the bit value based on a majority vote? Compare your results to an equivalent system 
that uses (7, 4) and (15, 11) FEC coding. 


Determine M given tbe following Y vectors that were generated via a (7, 4) Ham- 
ming code. Each one may have a single error. (a) Y = [0100101], (6) Y= 
[0111111], (c) Y = [1010111], (d) Y = [1101000]. 

Determine X for ASCII letter "E" and CRC-12 polynomial. 


For the transmitted codeword in Prob. 13.2-23, show that the following errors will 
be detected: errors in the last two digits. 


A channel that transmits 7-bit ASCII information at 9600 bits per second often has 
noise bursts lasting 125 ms, but with at least 125 ms between bursts. Using a (63, 
45) BCH code, design an interleaving system to eliminate the effects of these error 
bursts. What is the maximum delay time between interleaving and deinterleaving? 


Diagram the encoder for a systematic (3, 2, 3) convolutional code. Label the input 
and output rates and the current input and state at an arbitrary time. 


Diagram the encoder for a systematic (4, 3, 1) convolutional code. Label the input 
and output rates and the current input and state at an arbitrary time. 


A (3, 1, 2) encoder achieves maximum free distance when 


d. Fori an Hl. , 
x; = mam x} = x= mj; €x 
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(a) Construct the code trellis and state diagram. Then find the state and output 
sequence produced by the input sequence 1011001111. (b) Construct the modified 
state diagram like Fig. 13.1-8a, identify the minimum-weight path or paths, and 
determine the values of d; and M(d;). 


Do Prob. 13.3-3 for a (2, 1, 3) encoder with 
xj = m, 4€ m, ,QOm; xj = mi.39xi 


which achieves maximum free distance. Use the pattern in Fig. P13.3-4 for your 
state diagram. 





Figure P13.3-4 


Determine the output from the encoder of Fig. P13.3—5 for message input of 
M = (1101011101110000 ... ). 


Output 





Figure P13.3-5 


Determine the output from the (3, 2, 1) encoder of Fig. 13.3-5 for message input of 
M = (1101011101110000 ... ). 
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Determine the output from the (3, 2, 1) encoder of Fig. P13.3-7 for message input of 
M = (1101011101110000 ... ). 





Input X x Output 





Figure P13.3-7 


Determine if the codes produced by the encoders of Figs. 13.3-2, P13.3-5, P13.3-7, 
and P13.3-8 are catastrophic. 


Output 
Te 





Figure P13.3-8 


Use block-diagram reduction of Fig. 13.3-8b to get T(D, I) as in Eq. (7a). 
Derive Eq. (10) from Eqs. (8) and (9). 


Consider a (2, 1, 1) code with x; = m; and xj = m;.., mj. (a) Construct the code 
trellis and modified state diagram. Then identify the minimum-weight path or paths 
and find the values of d, and M(dj). (b) Use block-diagram reduction to obtain 
T(D, I) in the form of Eq. (7a). (c) Find and compare P,, from Eqs. (9) and (10), 
assuming a < 1. 


Do Prob. 133-11 for a (3, 1, 2) code with x= m,xj = m, .,€ mj, and 
dms m;., mj- m; 


Use the Viterbi algorithm and the code trellis in Fig. 13.3-4 to find the sequences 
Y + E and M when Y = 01 11 01 01 11 01 10 11. If two paths arriving at a given 
node have equal running metrics, arbitrarily keep the upper path. 
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Construct the code trellis for a (2, 1, 2) code with x; = mj- m; and xj 

Mj-2 Qm; _,. Then apply the Viterbi algorithm to find the mias Y + E and M 
when Y = 10 11 01 01 10 01 10 11. If two paths arriving at a given node have equal 
running metrics, arbitrarily keep the upper path. 


A systematic (2, 1, 4) code intended for feedback decoding with majority logic has 
x; = mj.,C€ mj.4€ mj. Construct a parity-check table like Fig. 13.3-14 showing all 
the error bits that appear in the check sums 5;.., through s;. Then find an orthogonal 
set of three check sums, and draw a complete diagram of the decoder. 


Do Prob. 13.3-15 with x; = m; ., m; , Qm, 
Show the block diagrams for the following RSC encoders: (a) (35, 21), (b) (34, 23). 


Encrypt and decrypt sequence x = 8, 27, 51 using the RSA algorithm with: (a) p = 5, 
q^71lLe-7(b)p-7^1llLq-2^37e-2T7;()p-13,4-237,d-—173. 


List all the private/public keys for an RSA system with p = 11 and q = 31. 
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CHAPTER 14 © Bandpass Digital Transmission 


L digital transmission usually requires CVV modulation to generate a bandpass signal suited to the trans- 
mission medium—be it radio, cable, telephone lines [for personal computer Internet connection], or whatever. Just 
as there are a multitude of modulation methods for analog signals, there are many ways of impressing digital infor- 
mation upon a carrier wave. This chapter applies concepts of baseband digital transmission and CW modulation to 
the study of bandpass digital transmission. 

We begin with waveforms and spectral analysis of digital CW modulation for binary and M-ary modulating sig- 
nals. Then we focus on the demodulation of binary signals in noise to bring out the distinction between coherent [syn- 
chronous) detection and noncoherent lenvelope} detection. Next we deal with quadrature-carrier M-ary systems, 
leading to a comparison of modulation methods with regard to spectral efficiency, hardware complexity, and system 
performance in the face of corrupting noise. We then conclude with a discussion of trelliscoded modulation [TCM], 
where we combine convolutional coding and M-ary digital modulation. 


OBJECTIVES 





After studying this chapter and working the exercises, you should be able to do each of the following: 


l. Identify the format of binary ASK, FSK, PSK, and DSB with baseband pulse-shaping waveforms (Sect. 14.1). 
2. State the distinctions between the various ASK, PSK, and FSK methods (Sect. 14.1). 


3. Calculate the error probabilities of binary and M-ary modulation systems. Or, for a specified error probability and 
noise level, specify the transmitter or receiver power (Sects. 14.2, 14.3, and 14.4). 


Me Ov e 


Describe the operation of the correlation receiver (Sect. 14.2), 

Predict error probabilities given conditional PDFs (Sect. 14.2). 

Specify the appropriate detector(s) for each of the binary and M-ary modulation systems (Sects. 14.2, 14.3, and 14.4). 
Explain trellis-coded modulation and predict the corresponding coding gains over conventional digital modula- 


tion methods (Sect. 14.5). 








14.1 DIGITAL CW MODULATION 


A digital signal can modulate the amplitude, frequency, or phase of a sinusoidal car- 
rier wave. If the modulating waveform consists of NRZ rectangular pulses, then the 
modulated parameter will be switched or keyed from one discrete value to another. 
Figure 14.1—1 illustrates binary amplitude-shift keying (ASK), frequency-shift 
keying (FSK), and phase-shift keying (PSK). Also shown, for contrast, is the wave- 
form produced by DSB modulation with Nyquist pulse shaping at baseband. Other 
modulation techniques combine amplitude and phase modulation, with or without 
baseband pulse shaping. 

In this section we’ll define specific types of digital modulation in terms of 
mathematical models and/or transmitter diagrams. We’ll also examine their power 
spectra and estimate therefrom the transmission bandwidth required for a given dig- 
ital signaling rate. As preparation, we first develop a technique for spectral analysis 
of bandpass digital signals. 
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Figure 14.1-1 Binary modulated waveforms. [a) ASK; (b) FSK; (c) PSK; (d) DSB with baseband pulse shaping. 


Spectral Analysis of Bandpass Digital Signals 
Any modulated bandpass signal may be expressed in the quadrature-carrier form 
x(t) = Adx/(t) cos (wt + 0) — x,(t) sin (ot + 8)] [1] 


The carrier frequency f, amplitude A,, and phase 0 are constant, while the time- 
varying i (in-phase) and q (quadrature) components contain the message. Spectral 
analysis of x,(t) is relatively easy when the i and q components are statistically inde- 
pendent signals and at least one has zero mean. Then, from the superposition and 
modulation relations in Sect. 9.2, the power spectrum of x,(t) becomes 


CLF) = SIGUE) + GLE +f) + GE - 4) + GEH 0] 


where G( f) and G,(f ) are the power spectra of the i and q components. For a more 
compact expression, we define the eye lowpass spectrum 


Gef) = = G(f) + Gf) [2] 
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So that 


2 


Gf) = F lC - fe) + Ga + 2] (31 


Thus, the bandpass spectrum is obtained from the equivalent lowpass spectrum by 
simple frequency translation. 


Now suppose that the i component is an M-ary digital signal, say 
x(t) = > a,p(t — kD) [4a] 
k 


where a, represents a sequence of source digits with rate r — 1/D. We assume 
throughout that source digits are equiprobable, statistically independent, and uncor- 
related. Consequently, Eq. (11), Sect. 11.1, applies here and 


oo 
G(f) = ezr|PCA)P + (mr. >) |Plnr)? df — nr) (45) 
n--oo 
Similar expressions hold when the q component is another digital waveform. 

The pulse shape p(t) in Eq. (4a) depends on the baseband filtering, if any, and 
on the type of modulation. Keyed modulation involves NRZ rectangular pulses, and 
we'll find it convenient to work with pulses that start at t = KD, rather than being 
centered at ? — kD as in Chap. 11. Accordingly, let 


1 o<t<D 
0 otherwise 


Pot) 2 u(t) — u(t - D) = { 


[5a] 


whose Fourier transform yields 
l 2 
[PY f)? = D? sinc? fD = 5 sinc? Z [5b] 
E 


If p(t) = pp(t) in Eq. (4a), then the continuous spectral term in Eq. (4b) will be 
proportional to |Pp(f )|*. Since sinc? ( f / r) is not bandlimited, we conclude from 
Eqs. (2) and (3) that keyed modulation requires f. >> r in order to produce a band- 
pass signal. 


Amplitude Modulation Methods 


The binary ASK waveform illustrated in Fig. 14.1-1a could be generated simply by 
turning tbe carrier on and off, a process described as on-off keying (OOK). In gen- 
eral, an M-ary ASK waveform has M — 1 discrete "on" amplitudes as well as the 
"off" state. Since there are no phase reversals or other variations, we can set the q 
component of x(t) equal to zero and take the i component to be a unipolar NRZ sig- 
nal, namely 


x(t) = M a,py(t — kD) dy OL lo rM RT lóa] 
k 
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The mean and variance of the digital sequence are 





lm. M-i guy. dag] 
My = a = > Ca yg irm [6b] 


Hence, the equivalent lowpass spectrum is 





es — 1 

— snf a MOM ge [7] 
obtained with the help of Eqs. (2), (4b), and (5b). 

Figure 14.1—2 shows the resulting bandpass spectrum G.( f ) for f > 0. Most of 
the signal power is contained within the range f. + r/2, and the spectrum has a 
second-order rolloff proportional to | f — f.| ? away from the carrier frequency. 
These considerations suggest the estimated transmission bandwidth to be B; = r. 
If an M-ary ASK signal represents binary data at rate r, = rlog, M, then By = 
r,/log, M or 





Gef) = G(f) = 


r,/Br = log; M bps/Hz [8] 


This ratio of bit rate to transmission bandwidth serves as our measure of modulation 
"speed" or spectral efficiency. Binary OOK has the poorest spectral efficiency since 
r,/Br = 1 bps/Hz when M = 2. 

Drawing upon the principle of quadrature-carrier multiplexing, quadrature- 
carrier AM (QAM) achieves twice the modulation speed of binary ASK. Figure 
14.1~3a depicts the functional blocks of a binary QAM transmitter with a polar 
binary input at rate r,. The serial-to-parallel converter divides the input into two 
streams consisting of alternate bits at rate r = r,/2. Thus, the i and q modulating 
signals are represented by 


x(t) = 2,22 Pp(t — kD) xét) = 2; an Pol! = kD) 


GAF) 








0 Ák-r fs JEET 


Figure 14.1-2 ASK power spectrum. 
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Figure 14.1-3 Binary QAM. (a) Transmitter; (b) signal constellation. 


where D = 1/r = 2T, anda, = +1. The peak modulating values are x; = x, = +1 
during an arbitrary interval AD < t < (k + 1)D. Figure 14.1—3b conveys this 
information as a two-dimensional signal constellation. The four signal points have 
been labeled with the corresponding pairs of source bits, known as dibits. 

Summing the modulated carriers finally yields the QAM signal in the form of 
Eq. (1). The i and q components are independent but they have the same pulse shape 
and the same statistical values, namely, m, = 0 and o2 = 1. Thus, 


Galf) = 2X r|Po(f)P =~ sine? 22 ip 
lp L^ 
where we've used Eqs. (4b) and (5b) with r = r,/2. Binary QAM achieves 
r,/Br ~= 2 bps/Hz because the dibit rate equals one-half of the input bit rate, reduc- 
ing the transmission bandwidth to B; = r,/2. 

Keep in mind, however, that ASK and QAM spectra actually extend beyond the 
estimated transmission bandwidth. Such spectral “spillover” outside By becomes an 
important concern in radio transmission and frequency-division multiplexing sys- 
tems when it creates interference with other signal channels. Bandpass filtering at 
the output of the modulator controls spillover, but heavy filtering introduces ISI in 
the modulated signal and should be avoided. 

Spectral efficiency without spillover is achieved by the vestigial-sideband mod- 
ulator diagrammed in Fig. 14.1—4a. This VSB method applies Nyquist pulse shaping 
to a polar input signal, as covered in Sect. 11.3, producing a bandlimited modulating 
signal with B = (r/2) + By. The VSB filter then removes all but a vestige of width 
By from one sideband, so G,(f) looks something like Fig. 14.4b—a bandlimited 
spectrum with B; = (r/2) + By + By. Therefore, if r = r,/log, M, then 


r,/Br = 21og; M (10] 


and the upper bound holds when By << rand By < r. 
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Figure 14.1-4 Digital VSB. [a] Transmitter; (b) power spectrum. 





EXERCISE 14.1-1 


Binary data is to be transmitted on a 1-MHz carrier. Spillover is not a concern, but B; 
must satisfy the fractional bandwidth constraint B;/f. = 0.1. Estimate the maximum 


possible bit rate r, when the modulation is: (a) OOK, (b) binary QAM, (c) VSB with 
M — 8. 





Phase Modulation Methods 


The binary PSK waveform back in Fig. 14.1-1c contains phase shifts of +7 radi- 
ans, often described as binary phase-shift keying (BPSK) or phase-reversal key- 
ing (PRK). An M-ary PSK signal has phase shift ¢, in the time interval 
kD < t < (k + 1)D, expressed in general by . 


x(t) =A, X, cos (w,t + 0 + o )py(t — kD) [11] 
k 


Trigonometric expansion of the cosine function yields our desired quadrature- 
carrier form with 


x(t) = Eh -kD) x()- LOspolt— kD) Nad 
where 


I, = coso, Q, = sind, [12b] 

To ensure the largest possible phase modulation for a given value of M, we'll take 
the relationship between ó, and a, to be 

b, v (2a, + N)/M a8,—70,1,...,M—I [13] 


in which N is an integer, usually O or 1. 
Examples of PSK signal constellations are shown in Fig. 14.1—5, including the 
corresponding binary words in Gray code. The binary words for adjacent signal 
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(01) 


QD (00) , 





(10) 


fa) 





Figure 14.1-5 PSK signal constellations. {a} M = 4; (b) M = 8. 


points therefore differ by just one bit. The PSK signal with M = 4 and N = O repre- 
sented in Fig. 14.1—5a is designated quaternary or quadriphase PSK (QPSK). 
Had we taken QPSK with N = 1, the signal points would have been identical to 
binary QAM (Fig. 14.1-3)). Indeed, you can think of binary QAM as two BPSK 
signals on quadrature carriers. M-ary PSK differs from M-ary ASK, of course, since 
an ideal PSK waveform always has a constant envelope. 


PSK spectral analysis becomes a routine task after you note from Eqs. (12b) 
and (13) that 


Hence, the i and q components are statistically independent, and 
r 1 
Gulf) = 2 x TIPP) = + sin? na 


Comparison with Eq. (7) reveals that Gif) will have the same shape as an ASK 
spectrum (Fig. 14.1—2) without the carrier-frequency impulse. The absence of a dis- 
crete carrier component means that PSK has better power efficiency, but the spectral 
efficiency is the same as ASK. 

Some PSK transmitters include a BPF to control spillover. However, bandpass 
filtering produces envelope variations via the FM-to-AM conversion effect dis- 
cussed in Sect. 5.2. (Remember that a stepwise phase shift is equivalent to an FM 
impulse.) The typical nonlinear amplifier used at microwave carrier frequencies will 
flatten out these envelope variations and restore spillover—largely negating the 
function of the BPF. A special form of QPSK called staggered or offset-keyed 
QPSK (OQPSK) has been devised to combat this problem. The OQPSK transmitter 
diagrammed in Fig. 14.1—6 delays the quadrature signal such that modulated phase 
shifts occur every D/2 = T, seconds but they never exceed 77/2 radians. Cutting 


the maximum phase shift in half results in much smaller envelope variations after 
bandpass filtering. 
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BPF 


Figure 14.1-6 Offset-keyed QPSK transmitter. 









x(t) Data 


converter 





When envelope variations are allowable, combined amplitude-phase keying 
(APK) is an attractive family of modulation methods. APK has essentially the same 
spectral efficiency as PSK, but it can achieve better performance with respect to 
noise and errors. Further discussion is postponed to Sect. 14.4. 





= 0, 1. EXERCISE 14.1-2 


Draw the signal constellation for binary PSK with $, — z(2a, — 1)/4 and a, 
Then determine the lowpass equivalent spectrum and sketch G,(f). 





| Frequency Modulation Methods 


There are two basic methods for digital frequency modulation. Frequency-shift 
keying (FSK) is represented conceptually by Fig. 14.1—7a, where the digital signal 
| x(t) controls a switch that selects the modulated frequency from a bank of M oscilla- 
tors. The modulated signal is discontinuous at every switching instant t = KD. 
Unless the amplitude, frequency, and phase of each oscillator has been carefully 
| adjusted, the resultant output spectrum will contain relatively large sidelobes which 
don't carry any additional information and thus waste bandwidth. Discontinuities 
are avoided in continuous-phase FSK (CPFSK) represented in Fig. 14.1—7b, where 
| x(t) modulates the frequency of a single oscillator. Both forms of digital frequency 
modulation pose significant difficulty for spectral analysis, so we'll limit our con- 
i sideration to some selected cases. 
| First, consider M-ary FSK. Let all oscillators in Fig. 14.1-7a have the same 
amplitude A, and phase 0, and let their frequencies be related to a, by 


fff 4; 7 El; EJ EM = 1) [15a] 


which assumes that M is even, Then 


x5 = A, >, cos (wt + 0 + c,a,t)pp(t — kD) [155] 
k 
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x(t) x(t) x(t) Frequency 
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Figure 14.1-7 Digital frequency modulation. [a] FSK; (b] continuous phase FSK. 


where w4 = 27 fa. The parameter f; equals the frequency shift away from f. when 
a, = +1, and adjacent frequencies are spaced by 2 f}. Continuity of x(t) at t = kD 
is assured if 204 D = 27rN where N is an integer. 


We'll analyze a version of binary FSK known as Sunde's FSK, defined by the 
foregoing relations with M = 2, D = T, = 1/r,, and N = 1. Then pp(t) = u(t) — 
u(t = KT,) and 

fa => r,/2 [16] 
After trigonometric expansion of x,(t), we use the fact that a, = +1 to write 
COS Wya,t = COS (4t Sin wW 7a,t = a, sin wt 
The i component thereby reduces to 
x(t) = cos mrt [17a] 


independent of a,. The q component contains a, in the form 


x(t) = >) a, sin (ery)|u(t — KT) — u(t — kT, — 7,)) [175] 
= » Q,p(t — kT) k = lad an 
where 
p(t) = sin (mr,t)[u(t) — u(t — T;)] [17d 


The intervening manipulations are left to you as an instructive exercise. 

Once again, we have independent i and q components. The i component, being 
a sinusoid, just contributes spectral impulses at *7,/2 in the equivalent lowpass 
spectrum. The power spectrum of the q component contains no impulses since 
Q, 0, whereas Q2 — a2 — 1. Thus, 


Galf) = js = z) + (r- 2) + nJPCF)? [18a] 


where 
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POP = qa sine) au 0B 





7 + sinc ————— [185] 
4r, r, lp 
_ 4 = (mf/r;) | 
zr LQfiny - 1 


The resulting bandpass spectrum is shown in Fig. 14.1-8. 

Observe that the impulses correspond to the keyed frequencies f. + fa = 
f. = r,/2, and that the spectrum has a fourth-order rolloff. This rapid rolloff means 
that Sunde's FSK has very little spillover for | f — f,| > r, We therefore take 
By; 7 ry, even though the central lobe of G,{ f ) is 5096 wider than the central lobe of 
a binary ASK or PSK spectrum. 

Another special case is M-ary orthogonal FSK, in which the M keyed frequen- 
cies are equispaced by 2 f, = 1/2D = r/2. Without attempting the spectral analysis, 
we can surmise that By = M X 2 f4 = Mr/2 = Mr,/(2 log, M). Therefore, 


r,/Br = (210g;M)/M [19] 


and the modulation speed is less than M-ary ASK or PSK for M = 4. In other 
words, orthogonal FSK is a wideband modulation method. 


CPFSK may be wideband or narrowband depending on the frequency devia- 
tion. Let x(£) in Fig. 14.1—7b start at t = 0, so 


x(t) = WMa,ppyt— kD) =a, = £1, +2,..., (M — 1) 
k=0 
and frequency modulation produces the CPFSK signal 


t 
x(t) = A, cos [os +0 + oy [ x0) a t=0 
0 


G&f) 





1 i 
3 
fet 3o 


0 7 
fR-ín | fe | 2 
fe -h2 ftn 


Figure 14.1-8 Power spectrum of binary FSK with fy = r,/2. 
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To bring out the difference between CPFSK and FSK, consider the integral 
t co i 
EG dÀ = Sal Pp(à — kD) dà 
0 k=0 vo 


in which pp(A — kD) = O except for kD < A < (k + 1)D when pp(A — kD) =1. 
Piecewise integration yields 


| 


| 20) aa = a O<t<D 
0 


|! 


aD + a,(t — D) D<t<2D 


( a )D + «t = D) kD<t<(k+1)D 


j=0 


Now we can express x,(t) in the summation form 


Mes 


x(t) =A, >, cos [wt - 0 + 6, t eat — kD)]pp(t - kD) — [20a] 


k 


y 
o 


where t = 0 and 


k-i 
Px £ wD ` aj [20b] 
j-0 


with the understanding that @, = Ofork = 0. 

Equation (20) shows that CFPSK has a frequency shift fya, in the interval 
kD < t < (k + 1)D, just like FSK. But it also has a phase shift $, that depends on 
the previous digits. This phase shift results from the frequency-modulation process 
and ensures phase continuity for all t. Unfortunately, the past history embodied in ¢, 
greatly complicates CPFSK spectral analysis. Proakis (2001, Chap. 4) gives further 
details and plots of G,(f } for various values of f; when M = 2, 4, and 8. To con- 
clude this section, we'l] examine an important special case of binary CPFSK called 
minimum-shift keying (MSK). 





Carry out the details omitted in the derivation of Eqs. (17a)-(17c). Hint: Show that 
sin wat = sin [walt — kT,) + km] = cos (kr) X sin [e(t — kT,)]. 


EXERCISE 14.1-3 





Minimum-Shift Keying* 
Minimum-shift keying, also known as fast FSK, is binary CPFSK with 


Fe 


gie 
fa a= Xl $,7— Ma [21] 
4 2 1-9 
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Notice that the frequency spacing 2 f; = 


r,/2 is half that of Sunde's FSK. This fact, 
together with the continuous-phase property, results in a more compact spectrum, 
free of impulses. Subsequent analysis will prove that G(f ) = G,(f ) and 


Gs) d sine OY) ua ELT 





(n/2) E (n2) 
_ 16 E: (27f/rp) i 
wr, (4f/r,)° =) 


The bandpass spectrum G,(f ) plotted in Fig. 14.1—9 has minuscule spillover beyond 
the central lobe of width 3r,/2. The rapid rolloff justifies taking By = r,/2, so 


r,/Br = 2 bps/Hz 


which is twice the modulation speed of Sunde’s FSK and accounts for the name 
"fast" FSK. 


Our investigation of MSK starts with the usual trigonometric expansion to put 
x(t) in quadrature-carrier form with 


(22] 





x(t) = 2; (Pk + avc)pr (t — KT) 
x(t) = by sin (by + a,c,)pr(t — kT;) 


> 
Hl 


0 
where 


Wy 

cet Kn) aft) = ult) - u(t Rn) 
We'll also draw upon the behavior of $, versus k as displayed in the trellis pattern of 
Fig. 14.1—10. This pattern clearly reveals that 6, = 0, +7, £27, ..., for even val- 
ues of k while 6, = =77/2, +37/2, ..., for odd values of k. 
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Figure 14.1-9 MSK power spectrum. 
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Figure 14.1-10 MSK phase trellis. 
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As a specific example, let the input message sequence be 100010111. The 
resulting phase path d$, is shown in Fig. 14.1—11a, taking a, = +1 for input bit 1 
and a, = —1 for input bit 0. The corresponding i and q waveforms calculated from 
the foregoing expressions are sketched in Fig. 14.1-11b. We see that both wave- 
forms have zeros spaced by 2T,, but staggered such that the zeros of x,(t) coincide 
with the peaks of x,(t), and vice versa. These observations will guide our subse- 
quent work. 

Consider an arbitrary time interval between adjacent zeros of the i component, i.e., 





with & being even. During this interval, 
x(t) = cos (64; + @y—1Cx-1) Prt — (k — 1)T,) 
+ cos (6, + a,c,)pr(t — KT;) 


which we seek to combine into a single term. Since k is even, sin œ} = 0, and rou- 
tine trigonometric manipulations yield 


cos (py + a,c,) = cos 6, cos (a,c,) = cos $,cos c, 
Likewise, using 
cos $,., = 0 Par = 6, — a, 7/2 Ck-1 = Ck t T/2 
we get 
cos (6... + ap-1Ck-1) = —sin p- sin (ay.,6,.1) 


2 
= ag- COS dy COS C, = COS Hy COS C, 
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Message 




















(b) 


Figure 14.1-11 Illustration of MSK. (a) Phase path; (b) i and q waveforms. 


Thus, for the interval in question, 
x(t) = cos $, cos c, (pr [t — (k — 1)7] + Prt — k7;)] 


cos $, cos [(mr,/2)(t — kKT,)Mu(t — kT, + T,) — u(t — kT, — T;)] 


Summing intervals to encompass all t = 0 finally yields 


{I 


x(t) = > I, p(t s kT,) I, — COS Py [23] 
k even 
where 
P(t) = cos (wr, t/2)[u(t + T,) — u(t — T;)] [24] 


This result checks out against the waveform in Fig. 14.1-110 since 7, = cos $, 


= —] when kis even. 


eee 
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Now, for the g component, we consider the interval (k — 1)7, «t < (k + 1)T, 
with k odd. Similar manipulations as before lead to 
x(t) = sin $, cos cl prt — (k — 1)T,] + prt — kT,)} 
Thus, for all t = 0, 


= > Q,p(t — kT,) Q, = sind, (25] 


k odd 


which also agrees with Fig. 14.1-11b. Equation (22) follows from Eqs. (23)-(25) 
since the i and q components are independent, with I, = Q, = 0 and i= =0 = — ]. 





14.2 COHERENT BINARY SYSTEMS 


Coherent bandpass digital systems employ information about the carrier frequency 
and phase at the receiver to detect the message—like synchronous analog detection. 
Noncoherent systems don't require synchronization with the carrier phase, but they 
fall short of the optimum performance made possible by coherent detection. 

This section examines coherent binary transmission, starting with a general 
treatment of optimum binary detection in the presence of additive white gaussian 
noise (AWGN). The results are then applied to assess the performance of specific 
binary modulation systems. We'll focus throughout on keyed modulation (OOK, 
PRK, and FSK), without baseband filtering or transmission distortion that might 
produce ISI in the modulated signal. 


Optimum Binary Detection 


Any bandpass binary signal with keyed modulation can be expressed in the general 
quadrature-carrier form 


TORRID p(t- en)| cos (o,t +0) — È Qupét — e) | sin (c, t + a} 


For practical coherent systems, the carrier wave should be synchronized with the 
digital modulation. Accordingly, we’ll take 9 = 0 and impose the condition 


LN Naty n 
where N, is an integer—usually a very large integer. Then 


x«t) "e A. >» [Mpt g kT,) cos w(t = kT,) e Qp B kT,) sin at T kT,)] 
k 


and we can concentrate on a single bit interval by writing 
x,(t) = s,,{t — kT,) kT, <t «€ (k + IT, [2] 
with 
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Sm(t) 2 AL pt) COS ci — Q: Palt) sin wt] 


Here, s,,(t) stands for either of two signaling waveforms, s)(t) and s,(t), represent- 
ing the message bits m = 0 and m = 1. 

Now consider the received signal x,(t) corrupted by white gaussian noise. We 
showed in Sect. 11.2 that an optimum baseband receiver minimizes error probability 
with the help of a filter matched to the baseband pulse shape. But binary CW modu- 
lation involves two different signaling waveforms, as in Eq. (2), rather than one 
pulse shape with two different amplitudes. Consequently, we must redo our previous 
analysis in terms of s9(t) and s(t). 

Figure 14.2-1 shows the proposed receiver structure labeled with the relevant 
signals and noise for the interval under consideration. This bandpass receiver is just 
like a baseband receiver with a BPF in place of an LPF. The filtered signal plus noise 
y(t) is sampled at t, = (k + 1)7,, the end of the bit interval, and compared with a 
threshold level to regenerate the most likely message bit m. We seek the BPF 
impulse response A(t) and threshold level V for optimum binary detection, resulting 
in the smallest average regeneration error probability. 

As in Sect. 11.2 let H, and Hp denote the hypotheses that m = 1 and m = Q, 
respectively. The receiver decides between H, and Hj according to the observed 
value of the random variable 


Y = y(t) = Zm m 


where 


I> 


Zm = zw) = [Sm(t — kT,)*h(t)] [3] 


tt, 


(k+ 1)T; 
| Sm(A — kT,)h(t, — A) da 


kT, 


| SORS — A) dA 


The noise sample n = n(t,) is a gaussian RV with zero mean and variance o°, so the 
conditional PDFs of Y given H, or Hy will be gaussian curves centered at z, Or zo, 











Ny2 
X(t) = S(t — kT,) BPE x) xp 
So + -— AC S/H Pm 5 
get V 








Figure 14.2-1 Bandpass binary receiver. 
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portrayed by Fig. 14.2-2. With the usual assumption of equally likely zeros and 
ones, the optimum threshold is at the intersection point, i.e., 


1 
Vor = 5 + Zo) 


Then, from the symmetry of the PDFs, P, = P, and 
P, = Q(lz, — zoļ/20) 


in which the absolute-value notation |z} — Z| includes the case of z} < Zp. 
But what BPF impulse response A(t) maximizes the ratio |z} — zg|/2o or, 
equivalently, |z} — zo|//4a?? To answer this question, we note from Eq. (3) that 
co 2 
zı- Zol? E | (5A) — so(A) A(T, — A) dA [4a] 


— O00 





where the infinite limits are allowed since s,,(t) = 0 outside of O < t < T;. We also 


note that 
2 No [7 2 No [7 2 
Qu. |A(t) |? dt = z \A(T, — A)P da [4b] 
Application of Schwartz’s inequality now yields 
li Rol” 1 | E 
mus x ccc t) — t)? dt 5 
4g? 2. Ng M ) Sol )] [ ] 
and the ratio is maximum if A(T, — t) = K(s,(t) — so(t) |. Thus, 
ht) = K{s,(T, — t) — So( T5 ax t) | [6] 


with K being an arbitrary constant. 
Equation (6) says that: 





Py (Ap) PyOlHy) 


Figure 14.22 Conditional PDFs. 
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Alternatively, you could use two matched filters with A,(t) = Ks,(T, — t) and 
ho(t) = Kso(T, — t) arranged in parallel per Fig. 14.2—3a; subtracting the output of 
the lower branch from the upper branch yields the same optimum response. In either 
case, any stored energy in the filters must be discharged after each sampling instant 
to prevent ISI in subsequent bit intervals. 

Another alternative, with built-in discharge, is based on the observation that the 
sampled signal value from the upper branch in Fig. 14.2-3a is 


Zmilte) = [ «00. — A) da 
0 


(k+ WT, 
| Sm(t — KTy)Ks,(t — KT) dt 
kT, 


and likewise for Z,,.9(t,). Hence, optimum filtering can be implemented by the sys- 
tem diagrammed in Fig. 14.2-3b, which requires two multipliers, two integrators, 
and stored copies of so(t) and s(t). 

This system is called a correlation detector because it correlates the received 
signal plus noise with noise-free copies of the signaling waveforms. Note that corre- 
lation detection is a generalization of the integrate-and-dump technique for matched 
filtering. It should also be noted that the matched filter and correlation detector are 
equivalent only at the sample time f,. 

Regardless of the particular implementation method, the error probability with 
optimum binary detection depends upon the ratio maximized in Eq. (5). This ratio, 






Ay(t)= | mÒ 


Ks (T, - 
si( b 2) T Zm = 2m (ty) g Zmolty) 
È 
| o= | Hi: a 
KsT; — t) i 
peT] 1 


Zmol t) 


SbF) 





sync 


(a) 





j (k+ DT, 
m tal 


——————— Zm 
pp 

eee 2 } 

kT, 


(b) 







St — KTh) 








| 


Ksy(t — kT, ) 


Figure 14.2-3 Optimum binary detection. (a) Parallel matched filters; (b) correlation detector. 
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in turn, depends on the signal energy per bit and on the similarity of the signaling 
waveforms. To pursue this point, consider the expansion 


T, 
| [s (t) — s (dt = E, + Ey ~ 25y 
0 
where 


E = so(t) dt 


I> 
oo 
a 
ta 
pu 
™ 
~~ 
Aa 
~ 
ty 
eo 
| 
———ás 
- 
N 


Ty 
^ 
Ej — | s,(£)so(£) dt 
0 


We identify E, and Ep as the respective energies of s(t) and so(t), while Ey, is 
proportional to their correlation coefficient. We define this correlation coefficient as 
T, 
p Ê =| «o t)so(t) dt [8] 
0 
V E,E, 
Since zeros and ones are equally likely, the average signal energy per bit is 


1 
E, = 3 + Ep) 








Therefore, 
Beg _ E By € 2B. Eg Ep 
SS [9a] 
2c max 2No No 
and l 
= Q[ V(E, — Eo)/Mo ] [9b] 
or, if equal signal energies, 
S QU V E a p)/No | [9d 


Equation (9) brings out the importance of Ej, relative to system performance when 
E, and Np are fixed and how system performance depends on the correlation coeffi- 
cient of the two signals. 


Finally, substituting Eq. (6) into Eq. (3) yields z} = K(E, — Ejo) and z = 
K(Eo = Ep), SO 


1 
Vopt = 5 + Zo) TES 


Note that the optimum threshold does not involve E. 
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Derive Eqs. (5) and (6) from Eqs. (4a) and (4b). Use Eq. 17, Sect. 3.6, written in the EXERCISE 14.2-1 
form 


In V(A)W#(A) da 


—co 





e = | [VAP da 
| [WOO P dA =o 


—Oo0 


and recall that the equality holds when V(A) and W(A) are proportional functions. 





Coherent OOK, BPSK, and FSK 


Although the crude nature of ASK hardly warrants sophisticated system design, a 
brief look at coherent on-off keying helps clarify optimum detection concepts. The 
OOK signaling waveforms are just 


s(t) = A. Pr,(t) cos ot s(t) = 0 [11] 


Our carrier-frequency condition f, = N./T, means that s,(t — kT,) = A, cos ot for 
any bit interval while, of course, so(t — kT,) = 0. Thus, a receiver with correlation 
detection simplifies to the form of Fig. 14.2-4, in which a local oscillator synchro- 
nized with the carrier provides the stored copy of s,(t). The bit sync signal actuates 
the sample-and-hold unit and resets the integrator. Both sync signals may be derived 
from a single source, thanks to the harmonic relationship between f, and r,. 

Now we use Eqs. (7) and (11) to obtain Ey = Ey) = O and 


tE _ AT, 
SEE 


so E, = E,/2 = A2T,/4. Setting the threshold at V = K(E, — Ej)/2 = KE, yields 
the minimum average error probability given by Eq. (9), namely 


P, = Q(VEJN,) = o( Vy.) [2] 


x(t) + noise (k+ DT, y(t) 
f S/H ^ 
kT, m 
| y 
KA, cos at t ! o—t 
i LI 


T, 2 

$ ATT, 
E, = 42| cos’ o, t dt = E + sinc 
0 





a 


Fp 








Bit sync 


Figure 14.2-4 Correlation receiver for OOK or BPSK. 
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Not surprisingly, the performance of coherent OOK is identical to unipolar base- 
band transmission. 

Better performance is achieved by coherent phase-reversal keying. Let the two 
phase shifts be 0 and zr radians, so 


5\(t) = A,pz(t) cos e.t — sot) = —s,(t) [13] 


The relation s(t) = —s,(t) defines antipodal signaling, analogous to polar base- 
band transmission. It quickly follows that 


E, = E, = Ep = A2 Ej = —E, 
SO E, = Eio = 2E, and 
P, = Q(V2EJN,) = Q( V2y,) [14] 


BPSK therefore gets by with 3 dB less signal energy than OOK, other factors being 
equal. 

Since so(t) = —s,(t), a coherent BPSK receiver requires only one matched fil- 
ter or correlator, just like OOK. But now V = 0 since E, = Ep, so the BPSK thresh- 
old level need not be readjusted if the received signal undergoes fading. Further- 
more, the constant envelope of BPSK makes it relatively invulnerable to nonlinear 
distortion. BPSK is therefore superior to OOK on several counts, and has the same 
spectral efficiency. We'll see next that BPSK is also superior to binary FSK. 

Consider binary FSK with frequency shift =f, and signaling waveforms 


5\(t) = A, p(t) cos 2a( f. + fa)t [15] 
Sot) = A. prt) cos 2n( fe — fa)t 
When f, + f; >> rj, E, = A2T,/2, whereas 
Ey, = E, sinc (4 fy/rp) [16] 


which depends on the frequency shift. If f, = r,/2, corresponding to Sunde’s FSK, 
then Eio = 0 and the error probability is the same as OOK. 

Some improvement is possible when phase discontinuities are allowed in x,(f), 
but E, — Ey = 1.22E, for any choice of f}. Hence, binary FSK does not provide 
any significant wideband noise reduction, and BPSK has an energy advantage of at 
least 10 log (2/1.22) ~ 2 dB. Additionally, an optimum FSK receiver is more com- 
plicated than Fig. 14.24. 


Suppose the optimum receiver for Sunde's FSK is implemented in the form of Fig. 
14.2-3a. Find and sketch the amplitude response of the two filters. 
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Timing and Synchronization 


Finally, we should give some attention to the timing and synchronization problems 
associated with optimum coherent detection. For this purpose, consider the band- 
pass signaling waveform and matched filter 


s(t) = Acpr(t) cos ost fT, = N, >} 
h(t) = Ks(T, — t) = KA. prt) cos ot 


When s(t) is applied to its matched filter, the resulting response is 





t=T 

z(t) = s(t)*h(t) ~ KEA( T | cos w,t [17] 
b 

where E = A?T,/2. The sketch of z(t) in Fig. 14.2—5 shows the expected maximum 

value z(T,) = KE, and the response for t > T, would be eliminated by discharging 

the filter after the sampling instant. 


But suppose there's a small timing error such that sampling actually occurs at 
t, = T,(1 + €). Then 


z(t) ^ KE cos 0, 0. = w,T,€ = 2vN,e 


so the timing error reduces the effective signal level by the factor cos 0,. Since 
lz, — zo? will be reduced by cos? 6,, while o? remains unchanged, the error proba- 


bility becomes 
E, RS Eio ) 
P, = Q| ./——— cos” 18 
e o( N, cos“ 0, [18] 


which follows from Eq. (9). As an example of the magnitude of this problem, take 
BPSK with y, = 8,r, = 2 kbps, and f, = 100 kHz; perfect timing gives 
P, = Q( v16) = 3 X 107°, while an error of just 0.3 percent of the bit interval 
results in 0, = 27(100/2) x 0.003 = 54° and P, = Q( V 16 cos? 54?) = 107°. 
These numbers illustrate why a bandpass matched filter is not a practical method for 
coherent detection. 








Figure 14.2-5 Response of bandpass matched filter. 
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A correlation detector like Fig. 14.2— has much less sensitivity to timing error, 
since the integrated output does not oscillate at the carrier frequency. Correlation 
detection is therefore used in most coherent binary systems. However, the local 
oscillator must be synchronized accurately with the carrier, and a phase synchro- 
nization error 9, again reduces the effective signal level by the factor cos 0,. 

In the case of BPSK, the carrier sync signal can be derived from x,(t) using 
techniques such as the Costas PLL system back in Fig. 7.3-4. Another approach 
known as phase-comparison detection is discussed in the next section, along with 
noncoherent detection of OOK and FSK. 


14.3 NONCOHERENT BINARY SYSTEMS 


Optimum coherent detection may not be essential if the signal is strong enough for 
adequate reliability with a less sophisticated receiver. A prime example of this situ- 
ation is digital transmission over voice telephone channels, which have relatively 
large signal-to-noise ratios dictated by analog performance standards. There are also 
applications in which it would be very difficult and expensive to carry out coherent 
detection. For instance, the propagation delay on some radio channels changes too 
rapidly to permit accurate tracking of the carrier phase at the receiver, and unsyn- 
chronized or noncoherent detection becomes the only viable recourse. 

Here we examine the suboptimum performance of noncoherent OOK and FSK 
systems that employ envelope detection to bypass the synchronization problems of 
coherent detection. We'll also look at differentially coherent PSK systems with 
phase-comparison detection. For all three cases we must first analyze the envelope 
of a sinusoid plus bandpass noise. 


Envelope of a Sinusoid Plus Bandpass Noise 


Consider the sinusoid A,cos (w,t + 0) plus gaussian bandpass noise n(f) with zero 
mean and variance o°. Using the quadrature-carrier expression 


n(t) = nj(t) cos (w,t + 0) — n,(t) sin (w,t + 0) 
we write the sum as 
Acos (wet + 0) + n(t) = A(t) cos [o,t + 0 + d(t)) 
where, at any instant f, 
n 
= V(A, + ny + n2 = — 1 
A (A, + n) + nj $ = arctan rs (1] 


We recall from Sect. 10.1 that the i and g noise components are independent RVs 

having the same distribution as n(r). Now we seek the PDF of the envelope A. 
Before plunging into the analysis, let’s speculate on the nature of A under 

extreme conditions. If A, = 0, then A reduces to the noise envelope A,, with the 


" 
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Rayleigh distribution 
An dog 
DIA m US Aged 2] 
c 


At the other extreme, if A, >> c, then A, will be large compared to the noise com- 
ponents most of the time, so 





A — A, VÀ + Qi n/A,) + (n? + n2)/A2 = A, + n; 


which implies that A will be approximately gaussian. 
For an arbitrary value of A,, we must perform a rectangular-to-polar conversion 


following the procedure that led to Eq. (10), Sect. 8.4. The joint PDF of A and œ 
then becomes 





A A? — 2A,.A cos $ + A? 
zep Ss Ts [3] 


A, = 
Pasl $) 270 20? 


for A = 0 and || = v. The term A cos @ in the exponent prevents us from factor- 
ing Eq. (3) as a product of the form p,(A)pg(¢), meaning that A and ó are not sta- 
tistically independent. The envelope PDF must therefore be found by integrating the 
joint PDF over the range of $, so 


A A? + A2N (7 A.A cos $ 
palA) = z €XXpi — 2 exp 2 do 
270 20 c 


gari 





Now we introduce the modified Bessel function of the first kind and order zero, 
defined by 


1 T 
Iv) & il exp (v cos à) db [4a] 
with the properties 
e^ v «Jj 
Iv) ~4_ e" [45] 
= v > 1 
V 2770 
We then have 
A 24 42) 1902 A.A 
palA) = =; e € +A2)/20 (Sr) A > 0 [5] 
c o 


which is called the Rician distribution. 


Although Eq. (5) has a formidable appearance, it easily simplifies under large- 
signal conditions to 


A 279 42 
DA Hae ee OO dise tel 
TAC” 
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Figure 14.3-1 PDFs for the envelope of a sinusoid plus bandpass noise. 


obtained from the large-v approximation in Eq. (4b). Since the exponential term 
dominates in Eq. (6), we have confirmed that the envelope PDF is essentially a 
gaussian curve with variance o? centered at A = A,. Figure 14.3-1 illustrates the 
transition of the envelope PDF from a Rayleigh curve to a gaussian curve as A, 
becomes large compared to o. l 


Noncoherent OOK 


Noncoherent on-off keying is intended to be a simple system. Usually the carrier 
and data are unsynchronized so, for an arbitrary bit interval KT, < t < (k + 1)T,, 
we write 
x(t) = Aca,prz (t — kT;)cos (w,t + 6) a,=0,1 . [7] 
The signaling energies are Ej = 0 and 
A?T, sin (2w,T, + 20) — sin20]  A?T, 
Ep, 2 H g . 08" sO usc 
2 2o, T, 2 
where we've assumed that f. => r, The average signal energy per bit is then 
E, = E,/2 ~= A2T,/4 since we'll continue to assume that 1s and Os are equally likely. 


The OOK receiver diagrammed in Fig. 14.3-2 consists of a BPF followed by an 
envelope detector and regenerator. The BPF is a matched filter with 


A(t) = KA, pr (t) cos w, t [8] 


which ignores the carrier phase 0. The envelope detector eliminates dependence on 
0 by tracing out the dashed line back in Fig. 14.2—5. Thus, when a, = 1, the peak 


signal component of the envelope y(t) is A, = KE). Let's take K = A JE, for con- 
venience, so that A; = A,. Then 


A2/ao? = 4E,/No = 4y, [9] 
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Figure 14.3-2 Noncoherent OOK receiver. 
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Figure 14.3-3 Conditional PDFs for noncoherent OOK. 


where o? is the variance of the bandpass noise at the input to the envelope detector, 
calculated from (7) using Eq. (45), Sect. 14.2. 

Now consider the conditional PDFs of the random variable Y = y(t,). When 
a, = 0, we have a sample value of the envelope of the noise alone; hence, py(y|Ho) 
is the Rayleigh function p, (y). When a, — 1, we have a sample value of the enve- 
lope of a sinusoid plus noise; hence, p,(y|H,) is the Rician function p,(y). Figure 
14.3-3 shows these two curves for the case of y, >> 1, so the Rician PDF has a 
nearly gaussian shape. The intersection point defines the optimum threshold, which 


turns out to be 
V go a Yp, 1 
opt 2 Yp 2 b 


But we no longer have symmetry with respect to the threshold and, consequently, 
Pe, P, when P, is minimum. 

Noncoherent OOK systems require y, >> 1 for reasonable performance, and 
the threshold is normally set at A,/2. The resulting error probabilities are 


A/2 
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p= (o9) à m o( 5) = otv) 


5 2c 
1 —yb/2 
x A Vo => 1 [10b] 
NYa 


where we've introduced the asymptotic approximation for Q( V y,) to bring out the 
fact that P, «€ P, when y, 22» 1. Finally, 


P, e KP, +P.) Me" + 0( V3] mi 
= fe Ye > 1 


which is plotted versus y, in Fig. 14.3—4 along with curves for other binary systems. 





Consider the BPF output z(t) = x(t)*h(r) when x(t) =A 


EXERCISE 14.3-1 -Pr{t)cos (a, t + @) and 
K = 2/A,T,. Show that, for 0 < t < T,, 





A.t l 
z(t) = [cos Ó cos wt — (sin p 
T, 


cos@\ , 
sin c£ 
wt 


Then find and sketch the envelope of z(f) assuming f. => rp 





Noncoherent FSK 


Although envelope detection seems an unlikely method for FSK, a reexamination of 
the waveform back in Fig. 14.1-1b reveals that binary FSK consists of two inter- 
leaved OOK signals with the same amplitude A, but different carrier frequencies, 
fh=ft+f, and fo f. — fa Accordingly, noncoherent detection can be imple- 
mented with a pair of bandpass filters and envelope detectors, arranged per Fig. 
14.3-5 where 


h(t) = KA, Ppr, (t) cos wt h(t) = KA.pr(t) cos wot [12] 
We'll take K = A,/E,, noting that E, = E, = Eo ~= A?T,/2. Then 
At/o? = 2EJN, = 2y, [13] 


where c? is the noise variance at the output of either filter. 

We'll also take the frequency spacing f, — fo = 2f, to be an integer multiple of 
rp, aS in Sunde’s FSK. This condition ensures that the BPFs effectively separate the 
two frequencies, and that the two bandpass noise waveforms are uncorrelated at the 
sampling instants. Thus, when a, = 1, the sampled output y,(t,) at the upper branch 
has the signal component A, = KE, = A, and a Rician distribution, whereas yo(t,) 
at the lower branch has a Rayleigh distribution—and vice versa when a, = 0. 


eee ae EP 
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Figure 14.3-4 Binary error probability curves. (a] Coherent BPSK; (b) DPSK; (c) coherent 
OOK or FSK; (d) noncoherent FSK; (e) noncoherent OOK. 
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| Figure 14.3-5 Noncoherent detection of binary FSK. 
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Regeneration is based on the envelope difference Y, — Yo = y,(t,) — yo(t,). 
Without resorting to conditional PDFs, we conclude from the symmetry of the 
receiver that the threshold should be set at V = 0, regardless of A,. It then follows 
that P, = P(Y, — Yo < O|[Hi) and P,, = P, = P,. Therefore, 


P, = P(Y% > Y,|Hi) 


= | exon] | osos) dvo dj; 


where the inner integral is the probability of the event Y) > Y, for a fixed value of 


y. Inserting the PDFs py(yo| Hi) = pao) and py(vi|Hi) = pa(y:) and perform- 
ing the inner integration yields 


S TP A 
P, — | JL o ~Qyi+ A220? 1 2) dy, 
0 [02 Oo 





Rather amazingly, this integral can be evaluated in closed form by letting A = V2y, 
and œ = A,/V2 so that 


P, = Pese E eg (92e? (2) dÀ 
0 
The integrand is now exactly the same function as the Rician PDF in Eq. (5), whose 
total area equals unity. Hence, our final result simply becomes 
p,- l -Rjaa? = low [14] 
2 2 
having used Eq. (13). 

A comparison of the performance curves for noncoherent FSK and OOK plot- 
ted in Fig. 14.3-4 reveals little difference except at small values of y,. However, 
FSK does have three advantages over OOK: constant modulated signal envelope, 
equal digit error probabilities, and fixed threshold level V = 0. These advantages 
usually justify the extra hardware needed for the FSK receiver. 


Differentially Coherent PSK 


Noncoherent detection of binary PSK would be impossible since the message infor- 
mation resides in the phase. Instead, the clever technique of phase-comparison detec- 
tion gets around the phase synchronization problems associated with coherent BPSK 
and provides much better performance than noncoherent OOK or FSK. The phase- 
comparison detector in Fig. 14.3-6 looks something like a correlation detector except 
that the local oscillator signal is replaced by the BPSK signal itself after a delay of T,. 
A BPF at the front end prevents excess noise from swamping the detector. 


Successful operation requires. f, to be an integer multiple of r,, as in coherent 
BPSK. We therefore write 


x(t) = Acpr(t — kT) cos (cxt + 0 + agr) [15] 
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a, = 0,1 KT, < t € (k+ 1)7, 
In the absense of noise, the phase-comparison product for the kth bit interval is 
x(t) X 2x(t — T,) = 242 cos (w,t + 0 + ar) 
X cos [w(t — T,) + 0 + a, ym] 
= Afcos[(a, — a4-1)] 
+ cos [2m,t + 20 + (a, + ay iym ]) 


where we've used the fact that c, 7, = 27 N.. Lowpass filtering then yields 


+A? âk = Gy| 
t) = : [16] 
3 ) e ak F aki 


so we have polar symmetry and the threshold should be set at V = 0. 


(0 A 
E ^ 
4 


V 
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Figure 14.3-6 Differentially coherent receiver for binary PSK. 


Since z(t,) only tells you whether a, differs from a,_,, a BPSK system with 
phase-comparison detection is called differentially coherent PSK (DPSK). Such 
systems generally include differential encoding at the transmitter, which makes it 
possible to regenerate the message bits directly from z(1,). Differential encoding 
starts with an arbitrary initial bit, say ay — 1. Subsequent bits are determined by the 
message sequence m, according to the rule: a, = a, , if m, = l,a, F a,., if 
m, = 0. Thus, z(t) = --A2 means that m, — 1 and 2(t,) = —A2 means that 
m, — 0. Figure 14.3-7 shows a logic circuit for differential encoding; this circuit 
implements the logic equation 


Gk zx G1 m, Akl m, [17] 


ak 





Figure 14.3-7 Logic circuit for differential encoding. 
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where the overbar stands for logical inversion. An example of differential encoding 
and phase-comparison detection (without noise) is given in Table 14.3-1. 

To analyze the performance of DPSK with noise, we'll assume that the BPF 
performs most of the noise filtering, like the BPFs in an FSK receiver. Hence, the 
carrier amplitude and noise variance at the BPF output are related by 


At/o* = 2E,/Ns = 2y, 


We’ll also exploit the symmetry and focus on the case when a, = a,_, = 0, so an 
error occurs if y(t,) < 0. 

Now let the delayed i and q noise components be denoted by n;(t) = n,(t — T,) 
and n,(t) = n,(t — Tj). The inputs to the multiplier during the kth bit interval are 
x(t) + n(t) = [A, + n(t)] cos (w.t + 6) —n,(t) sin(w@,t+ 0) and 2[x(t — Tp) + 
n(t) — T,)] = 2[A, + ni(t)] cos (at + 6) — 2 n;(t) sin (mt + 6). The LPF then 
removes the high-frequency terms from the product, leaving 

Y = y(t) = (A, + ni(A, + nj) + nan [18] 
where all four noise components are independent gaussian RVs with zero mean and 
variance c. 


Equation (18) has a quadratic form that can be simplified by a diagonalization 
process, resulting in 


with 
a= (Atata g-pg-eg [19b] 
and 
A 1 ' A 1 1 
Qi = 5 Qu + ni) pi = 5 — nj) [19d 


A^ 1 " ^ 1 f 
a, — 5 ("a + nj) p 5 0 — ng) 


Note that o; is a zero-mean gaussian RV with variance o? = (n? + nj?)/4 = 
20?/4 = g?/2; identical conclusions hold for the other i and q components of a and 
B. Therefore, œ has a Rician PDF given by Eq. (5) with o?/2 in place of o°, while B 
has a Rayleigh PDF given by Eq. (2) with o?/2 in place of o°. 





Table 14.3-1 

Input message 10110100 
Encoded message 1 1000 1 10 1 
Transmitted phase ww700 0 wna 0 
Phase-comparison sign +-++-+- - 
Regenerated message 101 10100 








14.3  Noncoherent Binary Systems 


Lastly, since « and £ are nonnegative, we can write the average error probabil- 
ity as 


P, = P(Y < Ola, = a,_;) = P(o? < B?) = P(B > a) 


and we’ve arrived at an expression equivalent to the one previously solved for non- 
coherent FSK. Substituting 7/2 for a? in Eq. (14) now gives our DPSK result 

E s 
The performance curves in Fig. 14.3-4 now show that DPSK has a 3-dB energy 
advantage over noncoherent binary systems and a penalty of less than 1 dB com- 
pared to coherent BPSK at P, = 107%. 

DPSK does not require the carrier phase synchronization essential for coherent 
PRK, but it does involve somewhat more hardware than noncoherent OOK or 
FSK—including differential encoding and carrier-frequency synchronization with 
r, at the transmitter. A minor annoyance is that DPSK errors tend to occur in groups 
of two (why?). 





Binary data is to be sent at the rate r, = 100 kbps over a channel with 60-dB trans- 
mission loss and noise density Ny = 10^? W/Hz at the receiver. What transmitted 
power S, is needed to get P, = 10? for various types of modulation and detection? 

To answer this question, we first write the received signal power as 
Sp = Exry = Noyor, = S7/L with L = 10$. Thus, 


Sr = LNoyory = 0.1y, 


Next, using the curves in Fig. 14.3-4 or our previous formulas for P,, we find the 
value of y, corresponding to the specified error probability and calculate Sy there- 
from. 

Table 14.3—2 summarizes the results. The systems have been listed here in order 
of increasing difficulty of implementation, bringing out the trade-off between signal 
power and hardware complexity. 


Table 14.3-2 

System Sp W 
Noncoherent OOK or FSK 1.26 
Differentially coherent PSK 0.62 


Coherent BPSK 0.48 
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Suppose the system in the previous example has a limitation on the peak envelope 
power, such that L A? < 2 watts at the transmitter. Find the resulting minimum error 
probability for noncoherent OOK and FSK and for DPSK. 


EXERCISE 14.3-2 








14.4 QUADRATURE-CARRIER AND M-ARY SYSTEMS 


This section investigates the performance of M-ary modulation systems with coher- 
ent or phase-comparison detection, usually in a quadrature-carrier configuration. 
Our primary motivation here is the increased modulation speed afforded by QAM 
and related quadrature-carrier methods, and by M-ary PSK and M-ary QAM modu- 
lation. These are the modulation types best suited to digital transmission on tele- 
phone lines and other bandwidth-limited channels. 

As in previous sections, we continue to assume independent equiprobable sym- 
bols and AWGN contamination. We also assume that M is a power of two, consistent 
with binary to M-ary data conversion. This assumption allows a practical compari- 
son of binary and M-ary systems. 


Quadrature-Carrier Systems 


We pointed out in Sect. 14.1 that both quadriphase PSK and keyed polar QAM are 
equivalent to the sum of two BPSK signals impressed on quadrature carriers. Here 
we'll adopt that viewpoint to analyze the performance of QPSK/QAM with coher- 
ent detection. Accordingly, let the source information be grouped into dibits repre- 
sented by 1,Q,. Each dibit corresponds to one symbol from a quaternary (M = 4) 
source or two successive bits from a binary source. In the latter case, which occurs 
more often in practice, the dibit rate is r = r,/2 and D = 1/r = 2T,. 

Coherent quadrature-carrier detection requires synchronized modulation, as dis- 
cussed in Sect. 14.2. Thus, for the kth dibit interval kD « t « (k + 1)D, we write 


x(t) = s(t — kD) — s(t — kD) [1a] 
with 
s(t) = AclkPolt) COS et I, = +] [1b] 
s(t) = A, Q.pp(t) sin Qt Qi GS 
Since f, must be harmonically related to r — 1/D, the signaling energy is 
(k+1)D 1 
|. xoa aud + ODD = ALD 
kD 
and we have 


E = 2E, E, = A2D/2 [2] 


where E is the energy per dibit or quaternary symbol. 


14.4 Quadrature Carrier and M-ary Systems 


From Eq. (1) and our prior study of coherent BPSK, it follows that the optimum 
quadrature-carrier receiver can be implemented with two correlation detectors 
arranged as in Fig. 14.4—1. Each correlator performs coherent binary detection, 
independent of the other. Hence, the average error probability per bit 1s 


Pre = Q(V2EJN, = Q( V2y,) [3] 


where the function Q( V2y,) denotes the area under the gaussian tail—not to be 
confused with Q symbolizing quadrature modulation. 

We see from Eq. (3) that coherent QPSK/QAM achieves the same bit error proba- 
bility as coherent BPSK. But recall that the transmission bandwidth for QPSK/QAM is 
Brera 
whereas BPSK requires By = rj. This means that the additional quadrature-carrier 
hardware allows you to cut the transmission bandwidth in half for a given bit rate or 
to double the bit rate for a given transmission bandwidth. The error probability 

remains unchanged in either case. 

Equation (3) and the bandwidth/hardware trade-off also hold for minimum- 
shift keying, whose i and q components illustrated back in Fig. 14.1-11b suggest 
quadrature-carrier detection. An MSK receiver has a structure like Fig. 14.4—1 mod- 
ified in accordance with the pulse shaping and staggering of the i and q components. 
There are only two significant differences between MSK and QPSK: (1) the MSK 
spectrum has a broader main lobe but smaller side lobes than the spectrum of QPSK 
with the same bit rate; (2) MSK is inherently binary frequency modulation, whereas 
QPSK can be viewed as either binary or quaternary phase modulation. 

When QPSK/QAM is used to transmit quaternary data, the output converter in 
Fig. 14.4—1 reconstructs quaternary symbols from the regenerated dibits. Since bit 
errors are independent, the probability of obtaining a correct symbol is 


P, = (1 — By 
The average error probability per symbol thus becomes 
P, = 1 - P, = 20( V E/N) — Q'(V EIN) (4] 
—2Q(VE/JN,) | E/Ny >> 1 


where E — 2E, represents the average symbol energy. 


qk 
Regen 
Data m 
converter 


| 







x(t) + noise 








t Or 
- KA, sin at 
Figure 14.4-1 Quvadrature-carrier receiver with correlation detectors. 
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xí) 4th-law BPF PLL 
device 4f. 


Figure 14.4-2 PLL system for carrier synchronization in a quadrature-carrier receiver. 








cos 4 wet cos (ut + Nm2) 


Various methods have been devised to generate the carrier sync signals neces- 
sary for coherent detection in quadrature-carrier receivers. Figure 14.4—2 shows a 
simple PLL system based on the fact that the fourth power of x,(t) contains a dis- 
crete frequency component at 4f.. However, since cos 4w,t = cos (4w, t + 2x NN), 
fourfold frequency division produces cos (w,t + Na/2) so the output has a fixed 
phase error of Nzr/2 with N being an integer whose value depends on the lock-in 
transient. A known preamble may be transmitted at the start of the message to per- 
mit phase adjustment, or differential encoding may be used to nullify the phase error 
effects. Another carrier sync system will be described in conjunction with M-ary 
PSK; additional methods are covered by Lindsey (1972). 

Phase-comparison detection is also possible in quadrature-carrier systems with 
differential encoding. From our study of DPSK in Sect. 14.3, you may correctly 
infer that differentially coherent QPSK (DQPSK) requires somewhat more signal 
energy than coherent QPSK to get a specified error probability. The difference turns 
out to be about 2.3 dB. 








EXERCISE 14.4-1 


Consider a QPSK signal like Eq. (1) written as x,(t) = A;cos (w,t + $,) with $, = 
m /4, 37/4, 57/4, 77/4. Show that x2(t) includes an unmodulated component at 4f,. 





M-ary PSK Systems 


Now let's extend our investigation of coherent quadrature-carrier detection to 
encompass M-ary PSK. The carrier is again synchronized with the modulation, and 
f. is harmonically related to the symbol rate r. We write the modulated signal for a 
given symbol interval as 


x(t) = s(t — kD) — st — kD) [5a] 
with 
s(t) = A, cos o ,pp(t) cos «t [55] 
s(t) = A, sin $ Polt) sin wt 
where 


ó,-72maJM a,=0,1,...,M—1 








14.4 Quadrature Carrier and M-ary Systems 














t k fi 
x(t) + noise a Phase $k 
~ regen 
j (k DD 
kD Yq 








- KA, sin wt 


Figure 14.4-3 Coherent Mary PSK receiver. 


from Eq. (13), Sect. 14.1, taking N = 0. The signaling energy per symbol then 
becomes 


E= 2 A%(cos? $, + sin? $9,)D = PAID [6] 
equivalent to E, = E/log, M if each symbol represents log; M binary digits. The 
transmission bandwidth requirement is By = r = r,/log; M, from our spectral 
analysis in Sect. 14.1. 

An optimum receiver for M-ary PSK can be modeled in the form of Fig. 14.43. 
We'll let K = AJ E so, in absence of noise, the quadrature correlators produce 
z(t,) = A. cos à, and z (tj) = A, sin hy from which $, = arctan z,/z;. 

When x,(t) is contaminated by noise, message symbol regeneration is based on 
the noisy samples 


y; ^ A, cos f, + nj y, = Ae Sin oy + n, 


in which the ; and q noise components are independent gaussian RVs with zero 
mean and variance 


| 
o? = K?ENy2 = A2N)/2E = Nor [7] 


The generator has M angular thresholds equispaced by 27r/M, as illustrated in Fig. 
14.44, and it selects the point from the signal constellation whose angle is closest 
to arctan y,/y;. 

The circular symmetry of Fig. 14.4—4, together with the symmetry of the noise 
PDFs, means that all phase angles have the same error probability. We'll therefore 
focus on the case of $, = 0,so 

Yq nq 
arctan — = arctan = 


Yi et n 





and we recognize ¢ as the phase of a sinusoid plus bandpass noise. Since no error 
results if |¢| < 2/M, the symbol error probability can be calculated using 
T/M 
P, = P(\d| > «/M) »1— | pe (o) de [8] 
-n/M 


for which we need the PDF of the phase $. 
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Figure 14.4-4 Decision thresholds for Mary PSK. 


The joint PDF for the envelope and phase of a sinusoid plus bandpass noise was 
given in Eq. (3), Sect. 14.3. The PDF of the phase alone is found by integrating the 
joint PDF over 0 = A < oo. A few manipulations lead to the awesome-looking 


expression 
1 iso Ag COS 9 A? sin? ó A, cos $ | 
= — eg Ad/2o* 4 eo apse ae = COS DO 
Pl?) 2n * =: exo( DE leu S [9] 

for —7 < $ < a. Under the large-signal condition A, => o, Eq. (9) simplifies to 
A, cos $ NUUAM 7 

palp) ~ ——— ec sn 9/26 die [10] 
Eres VE 


which, for small values of $, approximates a gaussian with @ = 0 and $? = a?/A?. 
Equation (10) is invalid for |@| > 7/2, but the probability of that event is small if 
A, >> o. Figure 14.4—5 depicts the transition of pg(@) from a uniform distribution 
when A, = 0 to a gaussian curve when A, becomes large compared to ø. (See Fig. 
14.3—1 for the corresponding transition of the envelope PDF.) 

We'll assume that A, >> o so we can use Eq. (10) to obtain the error probabil- 
ity of coherent M-ary PSK with M > 4. (We already have the results for M = 2 and 
4.) Inserting Eq. (10) with A2/o0* = 2E/Np into Eq. (8) gives 


Des [D E us h e PEIN Gin Yh de n 
: V 27r No 
—m/M 


De TE edi 
=] =| eld 
0 


where we’ve noted the even symmetry and made the change of variable 
à = V2E/Np sin à so L = V2E/N, sin (/M). But the integrand in Eq. (11) is a 
gaussian function, so P, ^ 1 — (1 — 2Q(D)] = 2Q(L). Hence, 


14.4 Quadrature Carrier and M-ary Systems 








Figure 14.4-5 — PDFs for the phase of a sinusoid plus bandpass noise. 


2E 7 
P, = 20( — sin? =) [12] 


which is our final result for the symbol error probability with M > 4. We'll discuss 
the equivalent bit error probability in our comparisons at the end of the chapter. 

Returning to the receiver in Fig. 14.4—3, the carrier sync signals can be derived 
from the Mth power of x,(t) using a modified version of Fig. 14.4-2. The more 
sophisticated decision-feedback PLL system in Fig. 14.4—6 uses the estimated phase 
by to generate a control signal u(t) that corrects any VCO phase error. The two 
delayors here simply account for the fact that ¢, is obtained at the end of the Ath 
symbol interval. 

If accurate carrier synchronization proves to be impractical, then differentially 
coherent detection may be used instead. The noise analysis is quite complicated, but 
Lindsey and Simon (1973) have obtained the simple approximation 


4E : 
Qm 20( xi iz) [13] 


which holds for E/N, => 1 with M = 4. We see from Eqs. (12) and (13) that M-ary 
DPSK achieves the same error probability as coherent PSK when the energy is 
increased by the factor 

(o sin? (ar/M) 

— 2sin (z/2M) 


This factor equals 2.3 dB for DQPSK (M = 4), as previously asserted, and it 
approaches 3 dB for M > 1. 
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Delay . 
D Ji 
sin by 
e| Lens 








Figure 14.4-6 M-ary PSK receiver with decision-feedback system for carrier synchronization. 





EXERCISE 14.4-2 


Derive Eq. (7) by replacing one of the correlation detectors in Fig. 14.4—3 with an 
equivalent BPF, as in Fig. 14.23. 





M-ary QAM Systems 


We can represent the source symbols by combining amplitude and phase modulation 
to form M-ary QAM. M-ary QAM is also called M-ary amplitude-phase keying 
(APK). It is useful for channels having limited bandwidth and provides lower error 
rates than other M-ary systems with keyed modulation operating at the same symbol 
rate. Here we'll study the class of M-ary QAM systems defined by square signal con- 
stellations, after a preliminary treatment of suppressed-carrier M-ary ASK. 

Consider M-ary ASK with synchronized modulation and suppressed carrier. 
Carrier-suppression is readily accomplished by applying a polar modulating signal. 
Thus, for the kth symbol interval, we write 


x(t) = A.L.pp(t = kD) COS oct [14a] 
where 
l= £1, +3,..., (M = 1) [14b] 
The transmission bandwidth is By ~ r, the same as M-ary PSK. 


An optimum coherent receiver consists of just one correlation detector, since 
there's no quadrature component, and regeneration is based on the noisy samples 


yi — Al; 2 nj 
The noise component is a zero-mean gaussian RV with variance o? = Nor, as in Eq. 
(7). Figure 14.4—7 shows the one-dimensional signal constellation and the corre- 
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Figure 14.4-7 Decision thresholds for ASK with M = 4. 


sponding M — 1 equispaced thresholds when M = 4. The symbol error probability 


for any even value of M is 
P = 2(1 - Lol ae ) [15] 
i M V Nor 


obtained by the same analysis used for polar M-ary baseband transmission in Sect. 
11.2. 

Suppose that two of these ASK signals are transmitted on the same channel via 
quadrature-carrier multiplexing, which requires no more bandwidth than one signal. 
Let the information come from an M-ary source with M = u? so the message can be 
converted into two j-ary digit streams, each having the same rate r. The perfor- 
mance of M-ary QAM fundamentally depends upon the w-ary error rate and neres 
fore will be superior to direct M-ary modulation with M > p. 

Figure 14.4—8a diagrams the structure of our M-ary QAM transmitter. The out- 
put signal for the kth symbol interval is 


x(t) = s(t — kD) ~ s(t — kD) [16a] 





with 
s(t) = A.I,pp(t) COS wt I, = see +3. eats +(u = 1) [16b] 
st) = Ac QkPpolt) sin ot QO, = £1, +3,..., +(u — 1) 


The average energy per M-ary symbol is 
loa. 1 
E= a AcE + Q7)D = 34s — 1)D [17] 


since [2 = Q? = (y? — 1)/3. 

Coherent QAM detection is performed by the receiver in Fig. 14.4-8b, whose 
quadrature correlators produce the sample values 

Jim Ael + ni YSA Op T ns 

We then have a square signal constellation and threshold pattern, illustrated in Fig. 
14.4-8c taking M = 4? = 16. Now let P denote the probability of error for J, or Q,, 
as given by Eq. (15) with M replaced by u = VM. The error probability per M-ary 
symbolis P, = 1 — (1 — PY and P, = 2P when P «& 1. Therefore, 


1 3E | 
Peed quu iE [18] 
( a) | (M — DN, 
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Figure 14.4-8 Mary QAM system. [a] Transmitter; {b} receiver; (c) square signal constella- 
tion and thresholds with M = 16. 


14.4 Quadrature Carrier and M-ary Systems 


in which we've inserted the average symbol energy from Eq. (17). 

Calculations using this result confirm the superior performance of M-ary QAM. 
By way of example, if M = 16 and E/N = 100, then P, = 4 x 3/4 x Q( V20) 
= 1.2 X 107°, whereas an equivalent PSK system with M = 16 would have 


P, = 2Q( V1.6) = 6 x 107. 


Comparison of Digital Modulation Systems 


A performance comparison of digital modulation systems should consider several 
factors, including: error probability, transmission bandwidth, spectral spillover, 
hardware requirements, and the differences between binary and M-ary signaling. To 
establish an equitable basis for comparison, we'll make the realistic assumption that 
the information comes from a binary source with bit rate 7,. This allows us to com- 
pare systems in terms of the modulation speed 7,/B, and the energy-to-noise ratio y; 
needed to get a specified error probability per bit. 

Our previous results for binary modulation systems apply directly to the com- 
parison at hand, especially the error probability curves back in Fig. 14.3-4. Table 
14.4—1 serves as a more abbreviated summary when y, is large enough to justify the 
applicable approximations. (Thus, in the case of noncoherent OOK, almost all the 
errors correspond to the carrier "off" state.) The table omits coherent OOK and 
FSK, which have little practical value, but it includes QAM and QPSK viewed as 
binary rather than quaternary modulation. This listing emphasizes the fact that dou- 
bled modulation speed goes hand-in-hand with coherent quadrature-carrier detec- 
tion. Also recall that minimizing spectral spillover requires staggered keyed modu- 
lation (MSK or OQPSK) or additional pulse shaping. 

Now consider M-ary transmission with symbol rate r and energy E per symbol. 
We'll take M = 2* and introduce the data-conversion factor 


K — log; M 


which equals the number of bits per M-ary symbol. The equivalent bit rate and 
energy are r, — Kr and E, — E/K,so 


Yo = E/ KNo 
The modulation speed of M-ary PSK or M-ary QAM is 
r,/Br = K [1 9] 


Table 14.4-1 
Modulation 


Summary of binary modulation systems 








Detection ry Br P,, 
OOK or FSK (f, = r,/2) Envelope 1 je 
DPSK Phase-comparison . 1 ze” 
BPSK Coherent 1 OV 2) 
MSK, QAM, or QPSK Coherent quadrature 2 Q(V2y;) 
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since By = r = r,/K. The error probability per bit is given by 
Pre ey PJK 


providing that the data converter employs a Gray code, as discussed in conjunction 
with Eq. (24), Sect. 11.2. After incorporating these adjustments in our previous 
expressions, we get the comparative results listed in Table 14.4-2. The quantity 
rj/ By is often referred to as bandwidth efficiency. 


Table 14.4-2 Summary of M-ary modulation systems with r,/B; = K = log; M 





Modulation Detection 


P be 
DPSK (M = 4) Phase-comparison quadrature > of | AKy, sin? =) 
co PU 2M 





2 
PSK (M = 8) Coherent quadrature K G | 2Ky, sin? z) 
QAM (K even) Coherent quadrature E To + lof x ) 
: K\ Vm) “NNM” 





All of the quadrature-carrier and M-ary systems increase modulation speed at the 
expense of error probability or signal energy. Suppose, for example, that you want to 
keep the error probability fixed at P,, ^ 10 ^ —a common standard for comparison 
purposes. The value of y, needed for different modulation systems with various mod- 
ulation speeds then can be calculated from our tabulated expressions. Figure 14.4—9 
depicts the results as plots of r;/ By versus y, in dB, and each point is labeled with the 
corresponding value of M. Clearly, you would choose QAM over PSK for r,/Br = 4 
with coherent detection. M-ary DPSK eliminates the caxrier-synchronization problems 
of coherent detection, but it requires at least 7 dB more energy than QAM for 
r,/Br = 4. 

As our final comparison, Table 14.4—3 combines M-ary data from Fig. 14.4-9 
and calculates values for binary systems with the same error probability. The various 
systems are listed here in order of increasing complexity to bring out the trade-offs 
between modulation speed, signal energy, and hardware expense. 

You should keep two points in mind when you examine this table. First, the 
numerical values correspond to ideal systems. The modulation speed of an actual 
system is typically about 80 percent of the theoretical value, and the required energy 
is at least 1-2 dB higher. Second, the characteristics of specific transmission chan- 
nels may impose additional considerations. In particular, rapidly changing transmis- 
sion delay prohibits coherent detection, while transmission nonlinearities dictate 
against the envelope modulation of OOK and M-ary QAM. 

Other factors not covered here include the effects of interference, fading, and 
delay distortion. These are discussed in an excellent paper by Oetting (1979), which 
also contains an extensive list of references. 


14.5  Trellis-Coded Modulation 


ry/By» bps/Hz 








8 10 12 14 16 18 20 22 24 
Vb dB 
Figure 14.4-9 Performance comparison of Mary modulation systems with Pee = 1074. 


Table 14.4-3 Comparison of digital modulation systems with Phe = 1074 
Modulation Detection rJBr y» dB 


OOK or FSK ( f, = r,/2) Envelope 1 12.3 
DPSK (M = 2) Phase-comparison 1 9.3 
DQPSK Phase-comparison quadrature 2 10.7 
BPSK Coherent 1 8.4 
MSK, QAM, or QPSK Coherent quadrature 2 8.4 
DPSK (M = 8) Phase-comparison quadrature 3 14.6 
PSK (M = 8) Coherent quadrature 3 11.8 
PSK (M = 16) Coherent quadrature 4 16.2 
QAM (M = 16) Coherent quadrature 4 12.2 





14.5 TRELLIS-CODED MODULATION 


Let's consider a standard voice-telephone line with a 3.2 kHz bandwidth and a S/N 
of 35 dB used in conjunction with a modem to transmit a digital message. Accord- 
ing to the Hartley-Shannon law, discussed in Sect. 16.3, the line's capacity, or mes- 
sage rate, should be C = 37.2 kbps. However, for a fixed P,,, practical modems 
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using conventional digital modulation methods such as M-ary PSK and QAM have 
only achieved rates of up to 9.6 kbps—nowhere near the Hartley-Shannon limit. The 
rate could be increased without requiring additional bandwidth by simply adding 
more points in the signal constellation. But since the euclidean distances between 
these points are decreased, P,e increases. Conventional error correction codes could 
be employed, but the added redundancy reduces the overall message rate. The newly 
developed turbo codes would also allow us to approach the Hartley-Shannon limit, 
but due to their iterative nature they have a relatively long /atency time which may 
not be acceptable. 

In order to achieve message rates closer to the Hartley-Shannon limit on band- 
limited channels such as telephone and cable TV lines, we now consider trellis- 
coded modulation (TCM) which was developed in the early 1980s by Gottfried 
Ungerboeck of IBM Zurich Research Laboratory (Ungerboeck, 1982, 1987). 





TCM enables coding gains of at least 7 dB without bandwidth expansion and has 
enabled much higher baud rates for telephone modems. The cost incurred for this 
improvement is slightly increased decoder complexity. 


TCM Basics 


We first consider the M-ary PSK system whose constellation is shown in Fig. 


2 IAE Lour 
ET E ar) [1] 


14.5-1. Recall, 





In this case, the distance squared between adjacent points is 
T 
$c acted 
dz, 4sin T [2] 


and thus we can express Eq. (1) as 


2 E 4 
Py, = =Q\,/— di (3] 
be 2 of 2No » 


14.5 Trellis Coded Modulation 
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Figure 14.5-1 PSK signal constellation. {a} M = 4; [b] M = 8. 
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Figure 14.5-2 Generic m/[m + 1) encoder-modulator for TCM. 


To explain TCM, we first look at a QPSK system that has two bits per symbol and 
a constellation diagram as shown in Fig. 14.5-1a. In this case xx, = 01 is mapped 
to 77/2 radians, xx; = 10 is mapped to 377/2 radians, and so on. As Fig. 14.5-1 and 
Eq. (2) show, after mapping into the signal constellation, the minimum distance 


squared between adjacent symbols is d2,,, = doo] = 2. We use this as a reference 
for M-ary PSK TCM systems. 

The generic TCM system of Fig. 14.5-2 expands m message inputs x,, X2,..., Xin 
to generate m + 1 signal outputs Y}, yo, ..., Ym+1 giving us M = 2"*! channel sym- 


bols that are then mapped to an M-ary signal constellation. The overall encoding rate 
is thus R = m/(m + 1). The TCM structure includes using an m/(m + 1) rate con- 
volutional encoder, where m is equal to number of coded message bits and m = m. 
This encoder expands m message bits into rt + 1 signal outputs. 

The system of Fig. 14.5—3a, uses an m/(m + 1) = 1/2, 4-state, convolutional 
encoder to encode message bit x, into two bits, y, and y}, while the system of 
Fig. 14.5-3b, uses an m/(m + 1) = 2/3, 8-state convolutional encoder to encode 
the two message bits x,, x; into three bits y,, y2, and y3. In both systems, bits y,, y», 
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Figure 14.5-3 B-ary PSK encoder for TCM. (a) 4-state, m = 2, m = 1 convolutional encoder; 
[b] 8-state, m = 2, m = 2 convolutional encoder. 


and y3 are then mapped into an 8-ary PSK signal constellation of Fig. 14.5-1b with 
y¥3¥2y,  O0I mapped to 77/4 radians, y4y5 y, = 010 mapped to 77/2, and so on. 





This is in contrast to conventional error control coding schemes that seek to maxi- 
mize Hamming distances. An example of an 8-PSK partitioning process is shown in 
Fig. 14.54. 

Consider the system of 14.5—3a and its corresponding trellis diagram of 
Fig. 14.5-5. From each node, we have M branches corresponding to M outputs. 
Note that the parallel transitions are caused by the uncoded message bits. Let's 
assume the message is a successive sequence of x;x, = 00 inputs which produces a 
corresponding sequence of y4 y; y, = 000 outputs as shown in the dashed line of 
Fig. 14.5-6. At the destination, the signal is decoded using a similar trellis structure 
and the Viterbi algorithm to determine the trellis path that most likely correlates 
with the received sequence. Therefore to minimize errors, we want to maximize the 
free distance between alternate sequences. 

Let's say due to noise, an error occurs so that during decoding the received 
sequence diverges from the correct path. However, inherent in the TCM design, only 
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010 dy - 2 sin 7/8 
011 001 
100 000 
101 1 
110 
010 P d Ny 
011 001 
100 000 djs N2 
101 111 
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001 011 
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Figure 14.5-4 Partitioning of an 8-PSK signal set. 
SOURCE: Ungerbroeck, 1982. 
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p" 
1 000 
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Figure 14.5-5 Trellis diagram for the encoder of Figure 14.5-3a. 
SOURCE: Ungerbroeck, 1982. 
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Current state Output Next state 


q^ 000 q" +1 000 gr +2 000 grt 3 





de e e 


Figure 14.5-6 Two possible error events for the encoder of Figure 14.5-3a. Dashed line is 


transmitted sequence; solid lines are alternate received sequences; bold line 
is most likely error. 


certain deviations are allowed; therefore it may take several transmission intervals 
until the signal path eventually remerges back to the correct node. This serves to 
increase the free distance between the correct and alternate paths. 

Two different error events are shown in the solid lines of Fig. 14.5-6. Deviations 
from the correct path are measured by their respective euclidean distances away from 
000 on the signal constellation. The shorter error path has a distance squared of 
d? = doo = 4. Another possible error path sequence has a distance squared 
Of d$ = dloo + déoo-sooi + dooo-so1g = 2 + 4 sin? m/8 + 2 = 4.586. There- 
fore, when an error event does occur, its distance will be at least d2;, = 4. There are 
other possible error paths, but these will have the same or greater euclidean dis- 
tances, and thus we select d2,,, = dee = min(d?, d? ... ). 

For TCM, we define coding gain as 

A (dni E Joa [4] 
a (dus/ E )uscodea 
with E' and E being the energy of the coded and uncoded signals respectively. If 
both the coded and uncoded signals have the same normalized energy levels then 


= (d ET ) coded 


(d2, ) uncoded 


and the coding gain is the same as the distance gain squared. With uncoded QPSK as 
our reference, the coding gain for the system of Fig. 14.5-3a is g = 4/2 = 2,013 dB. 

For increased coding gain, we go to the 8-state, m — 2 TCM system of 
Fig. 14.5-35 with its corresponding trellis diagram of Fig. 14.5-7. Because all the 
message bits are coded, there are no parallel transitions to the next state. As before, 
let's assume the correct signal is a sequence of yy; y, = 000’s as shown in the 
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Figure 14.5-7 Trellis diagram for the encoder of Figure 14.5-3b. 
SOURCE: Ungerbroeck, 1982. 


dashed line of Fig. 14.5-8 with the minimum distance error event shown in the solid 
lines. Again, when an error does occur causing the signal to stray from the correct 
path, because only certain transitions are allowed, the signal may take several tran- 
sitions before it remerges to the correct node. As we also stated before, other error 
paths exist, but they will have the same or larger distances than this one. For the sig- 
nal constellation distances of Fig. 14.5-1b, we get 


297 
di = (doo 110 + dioii sm d$oo 110) EX uL sin’ > + 2 = 4.586 


and the coding gain becomes g = 4.586/2 = 2.293 or 3.6 dB. 
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Current state Output Next state 


q" 000 q" +1 000 q^ -2 000 qr +3 





he e. 


Figure 14.5-8 Error event for the encoder of Figure 14.5-3b. Dashed line is transmitted 
sequence; solid line is received sequence containing an error. 


As we already stated, we achieve maximum coding gains by maximizing the 
free distance between alternate sequences. This is done by careful encoding and det- 
inition of allowable transitions in the trellis diagram. Ungerbroeck (1982) has stated 
the following rules for optimal signal assignment: 


1. Allsignals should occur with equal frequency. 


2. All parallel transitions in the trellis diagram should be with signals that have the 
maximum euclidean distance. (This is why in the case of the TCM system of 
Figs. 14.5—3a and 14.5—5, transitions occur with signals such as y4y5 y, = 000 
and 100 that have a euclidean distance of dog9_,199 = 2 instead of signals 


y3y¥2y, = 000 and 001 that only have a euclidean distance of dooo.,09i = 
2 sin 7/8 = 1.414.) 


3. All other signals entering or leaving a given state should have the next maxi- 
mum possible euclidean distance. 
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Table 14.5-1 Coding gains for PSK-TCM systems 


88~PsK/opsK» dB 


States m m=2 N min(m — co) 
4 1 3.01 1.0 
8 2 3.60 2.0 
16 2 4.13 = 2.3 
32 2 4.59 4.0 
64 2 5.01 = 5.3 
128 2 5.10 = 0.5 
256 2. 5.75 =~ 1.5 


SOURCE: Ungerbroeck, 1987. 


Table 14.5-2 Coding gains for QAM TCM systems 





816-0am/8-PsK> dB 1832-oam/16-0am dB 
States m m=3 m=4 N mnn — co) 
4 d 4.36 3.01 4 
8 2 5.33 3.98 16 
16 2 6.12 4.77 56 
32 2 6.12 4.77 16 
64 2 6.79 5.44 56 
128 2 7:33 6.02 344 
256 2 7.3] 6.02 44 
512 2 7.37 6.02 4 


SOURCE: Ungerbroeck, 1987. 
NOTE: 'The 32-QAM has a “cross”-shaped constellation pattern and is referred to by Ungerbroeck as 32CR. 


Tables 14.5-1 and 14.5-2 show the potential coding gains versus number of states. 
Again, as stated, this assumes optimal encoding and partitioning of the signal set. 

TCM can also be implemented with M-ary QAM. The potential coding gains of 
coded 16-QAM and 32-QAM, as compared to uncoded 8-PSK and 16-QAM respec- 
tively, are shown in Table 14.52. 





Error Probability for Uncoded QPSK Versus 8-PSK TCM EXAMPLE 14.5-1 


For high signal-to-noise ratios and unity signal energy, an approximate expression 


for the lower bound for the error probability per M-ary symbol is given by (Unger- 
broeck, 1982) 


P, = Nain OCW d2,,/2 No) l6] 
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where Nj, is the average number of sequences whose distance is dnin from the 
correct sequence. Note the factor of N,, in Eq. (6) assumes only the likely error 
paths will significantly affect P,. It does not take into account other longer 
sequences. For uncoded QPSK, we observed from Fig. 15.5-1a, that each signal 
point has two adjacent points that are spaced at dnin = "V2 and thus N44, = 2. For 
E,/ Ng = 9 dB = 7.94, the error probability is 


2 
P,=20 = 5 x10 
: ol, = 


Previously, it was determined for TCM with 4-states and 8-PSK that dain = 2. The 
corresponding trellis diagram of Fig. 14.5—6 shows that, at any given state, for a set 
of y3¥2y¥, = 000 sequences, there is only one path, whose distance is d, that devi- 
ates from the correct sequence and therefore N,;, = 1. Thus, we get 


ET | 4 = -5 
»,- of zu) 3 x 10 


We said that N nin is an average. This is because with some signal constellations the 
number of nearest neighbors to a given point depends on its location. For example, 
in square signal constellations, interior points will have more neighbors than the 
exterior points. 





Hard Versus Soft Decisions 


As stated earlier, the coding gains we obtained with TCM come at a cost of 
increased decoder complexity. In conventional binary systems such as the one in 
Fig. 14.2-1, the demodulator outputs either m = 0 or m = 1 so that the decoder 
makes a hard decision based on two possible values of m. Thus, a significant 
amount of information may be lost prior to decoding. On the other band, TCM uses 
a demodulator that outputs a value of m that is typically 3 bits in length; thus the 
TCM decoder makes a soft decision based on 2? = 8 levels of m. These additional 
levels give the decoder more information about the signal than possible with only 2 
levels of m. Soft decision systems, TCM or otherwise, where the demodulator out- 
puts 8 quantization levels will have gains of 2 dB over hard decision systems. 
Extending this to an analog system with an infinite number of quantization levels, 
the gain would be only 2.2 dB. Therefore, 





For more information on TCM, see Ungerbroeck (1982), Biglieri et al. (1991), 
and Schlegal (1997). 
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Table 14.5-3 Selected telephone line data modems 





Model Bit Rate Modulation Transmitting Frequencies (Hz) 
Bell 103A? 300 bps FSK (1,070-1,270)/(2,025-2,225)! 
Bell 212A? 300/1200 bps DPSK 1200/2400! 
V.32 9600 bps 16-QAM or 32-QAM 1800 
with TCM 
V.32bis 14.4 kbps 128-QAM with TCM 1800 
V.34 28.8 kbps 960-QAM with TCM 1800 
V.34.bis* 33.6 kbps 1664-QAM with TCM 1800 
V.90 56/33.6 kbps? PCM — 





SOURCE: Lewart, 1998; Forney et al., 1996. 
NOTES: 'Originating/answering. ?Similar to V.21. ?Similar to V.22. 4Similar to V.34-1996 or V.34+. 


556 kbps downstream (server to user}, 33.6 kbps upstream; requires conditioned line. 


Modems 


A common application of bandpass digital modulation is the voice telephone 
modem (modulator/demodulator). This device modulates baseband digital signal 
from computer or fax to be put on a voice telephone line and then vice versa with the 
demodulator. As we discussed in Sect. 12.2, modems are an alternative to DSLs. 
Table 14.5-3 displays a selected list of Bell and the ITU (International Telecommu- 
nications Union) modem standards along with their respective rates and modulation 
methods. As we stated earlier, the Shannon limit for standard voice grade telephone 
lines is 37 kbps. Improvements in modulation and coding have enabled modems so 
they can get relatively close to this limit. Most commercial modems also have a fall- 
back option, so that at the initial connection and even during the session the modem 
will test the telephone line’s S/N to set or adjust the modem’s data rate. Thus in the 
case of a V.34 modem, if the line’s S/N is severely degraded, the 28.8 kpbs data rate 
drops to 14.4 or 9.6 kbps. 

Other types of modems for computer communication include cable modems for 
communication via the cable-TV network, LAN modems, wireless modems, and 
cellular telephone modems. Cable-TV systems with their bandwidths of 300 MHz 
promise data rates in the Gbps range. 





14.6 PROBLEMS 


14.1-1* Find from Eq. (7) the average power x? and the carrier-frequency power P, of an 
M-ary ASK signal. Then form the ratio P,/x? and simplify for M = 2 and M > 1. 


14.1-2 Suppose a binary ASK signal consists of RZ rectangular pulses with duration T,/2, 
where r, = 1/T, << f.. (a) Find the equivalent lowpass spectrum, and sketch and 
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label G,(f£) for f > 0. (b) Sketch the signal representing the sequence 010110. 
Then find the ratio of the carrier-frequency power P, to the average power x2. 


Consider a binary ASK signal with raised-cosine pulse shaping so, from Example 


2.5-2 
1 at t T sinc 2 fr 
t =- 1 + cos|— iH — P oe Se 
n | Er c3 VT Y- QR) 
(a) Sketch the signal representing the sequence 010110 when 7 = 7/2. Then find 
the equivalent lowpass spectrum, and sketch and label G,(f ) for f > 0. 


(b) Redo part a with 7 = T}. 
The envelope and phase variations of a QAM signal are 


A(t) = A x) + x4()]? AC) = arctan [x,(¢)/x((¢)] 


(a) By considering the time interval kD < t < (k + 1)D, obtain expressions for 
A(t) and $(t) with a rectangular pulse shape pp(t). 


(b) Redo part a with an arbitrary pulse shape p(t) whose duration does not exceed D. 


Let a polar M-ary VSB signal have Nyquist pulse shaping per Eq. (6), Sect. 11.3. 
Find the equivalent lowpass spectrum before VSB filtering. Then sketch and label 
Gf ) for f > 0 when the filter has By < r. 


Before bandpass filtering, the i and q components of the OQPSK signal generated in 
Fig. 14.1—6 can be written as 


x(t) = È aup — 2kT,) x(t) = DS) azip(t — 2kT, ~ T) 
k 


where a, = (2A, — 1) is the polar sequence corresponding to the message bit 
sequence Ap and p(t) = II(t/2T;) for NRZ rectangular pulse shaping. 


(a) Sketch x(t) and x,(t) for the bit sequence 10011100. Use your sketch to draw 
the signal constellation and to confirm that the phase (f) = arctan [x,(r)/x{t) | 
never changes by more than +7r/2 rad. 


(b) Find the equivalent lowpass spectrum. 


Let A, B, C, denote the Gray-code binary words for the eight-phase PSK constella- 
tion in Fig. 14.1—5b. Construct a table listing A, B, C, and the corresponding values 
of I, and Q, expressed in terms of a = cos 7/8 and B = sin 7/8. Then write 
algebraic expressions for I, and Q, as functions of A}, B} C,, a, and B. Devise from 
these expressions the diagram of a quadrature-carrier transmitter to generate the 


PSK signal, given a serial-to-parallel converter that supplies the binary words in the 
inverted polar form a, = 1 — 2 A, etc. 


Suppose a binary FSK signal with discontinuous phase is generated by switching 
between two oscillators with outputs A, cos (27 fot + 65) and A, cos (27 fit + 01). 
Since the oscillators are unsynchronized, the FSK signal may be viewed as the inter- 
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leaved sum of two independent binary ASK signals. Use this approach to find, 


sketch, and label G(f ) for f > 0 when f, = f. — r,/2 and f, =f, + r2 with 
Je © r, Estimate Br by comparing your sketch with Figs. 14.1-2 and 14.1-8. 


14.1-9 Starting with p(t) in Eq. (17c), obtain both forms of |P( f )|? as given in Eq. (18b). 


| 14.1-10F Consider a binary FSK signal defined by Eq. (15) with M = 2, D = T,, and 
B Q4 = N/T, where N is an integer. Modify the procedure used in the text and the hint 
given in Exercise 14.1—3 to obtain x,t) and x,(r). Then show that 


[ G(f) = Hafs- *) : a(r+ 2j 


E 1 — Nr,/2 + Nr,/2 |? 
E Gf) = H vag 2 + (= sine I 
. 4r, Ty r, 
E which reduce to Sunde's FSK when N - 1. 
i 14.1-11 Use the FSK spectral expressions given in Prob. 14.1—10 to sketch and label Gf ) 


for f > 0 when N = 2 and N = 3. Compare with Fig. 14.1-8. 


14.1-12 An OQPSK signal with cosine pulse shaping has many similarities to MSK. In par- 
ticular, let the i and q components be as given in Prob. 14.1-6 but take 
p(t) = cos (ar, t/2) 1 (t/2T,). 


(a) Sketch x,(t) and x,(t) for the bit sequence 100010111. Use your sketch to draw 
the signal constellation and to find the phase (t) = arctan [x,(t)/x(t)] at 
t = kT,,0 S k x 7. Compare these with Fig. 14.1-11. 


(b) By considering an arbitrary interval 2kT, < t < (2k + 1)T,, confirm that the 
envelope A(t) = A,[x2(t) + x?(t)]'” is constant for all t. 


(c) Justify the assertion that G,,(f ) is identical to an MSK spectrum. 
14.1-13 Derive the q component of an MSK signal as given in Eq. (25). 


14.1-14* Consider a BPSK system for a bandlimited channel with By = 3000 Hz where the 
spectral envelope must be at least 30 dB below the maximum outside the channel. 
What is the maximum data rate r, to achieve this objective? 


14.1-15 Repeat Prob. 14.1—14 for (a) FSK, (b) MSK. 


14.2-1 Draw and label the block diagram of an optimum coherent BPSK receiver with 
matched filtering. 


142-2 Suppose an OOK signal has raised-cosine pulse shaping so that 
s(t) = A, sin? (mt/T;)pz (t) cos w,t 


Draw and label the diagram of an optimum coherent receiver using: (a) matched fil- 
tering; (b) correlation detection. 


14.2-3 Obtain an exact expression for E, when a binary FSK signal has f, = N,r, but fy is 
arbitrary. Simplify your result when N, — f; T, => 1. 
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Use Table T.4 to show that taking f; ~ 0.35r, yields the lowest possible error prob- 
ability for binary FSK with AWGN. Write the corresponding expression for P, in 
terms of y,. 


Draw the complete block diagram of an optimum receiver for Sunde's FSK. Use 
correlation detection and just one local oscillator whose frequency equals r,. 
Assume that a bit-sync signal has been extracted from x,(t) and that N, is known. 


With perfect synchronization, a certain BPSK system would have P, = 10 ?. Use 
Eq. (18) to find the condition on 6, so that P, < 10 *. 


Consider a BPSK receiver in the form of Fig. 14.2-4 with local-oscillator output 
KA, cos (c, t + 0,), where 0, is a synchronization error. Show that the signal com- 
ponent of y(7;) is reduced in magnitude by the factor cos 0,. 


Find the exact expression for z(£) in Eq. (17). Then take N, >> 1 to obtain the given 
approximation. 


Consider a BPSK signal with pilot carrier added for synchronization purposes, 
resulting in 


s\(t) = [A, cos wt + a A, cos (et + 9) |pr(t) 
So(t) = [—A. cos ext + œA, cos (vt + 8) ]pz. (t) 


Take 0 = 0 and show that an optimum coherent receiver with AWGN yields 
P, = Q| V2y,/(1 + 2%) ]. 


Do Prob. 14.2-9 with @ = —7/2. 


When the noise in a coherent binary system is gaussian but has a nonwhite power 
spectrum G,(f), the noise can be “whitened” by inserting at the front end of the 
receiver a filter with transfer function H,( f ) such that |H,,(f)|’G,(f) = No/2. The 
rest of the receiver must then be matched to the distorted signaling waveforms ‘s,(t) 
and S(t) at the output of the whitening filter. Furthermore, the duration of the unfil- 
tered waveforms s,(f) and so(t) must be reduced to ensure that the whitening filter 
does not introduce appreciable ISI. Apply these conditions to show from Eq. (9a) that 


(=) = J Sif) ~ Soff)? 
20 


max Ln 4G,(f ) 
where S,(f) = #[s,(t) ], etc. Hint: Recall that if v(t) and w(f) are real, then 


df 


—oco — 


| vw) a= | vaywonar- | vcowonar 
oo — 09 

For coherent binary FSK, show that p = sinc (4 fj/r;). 

Determine p that minimizes P, for a coherent binary FSK system. 


Determine A, required to achieve a P, = 10 ? for a channel with Ny = 107" 
W/Hz, and BPSK with (a) r, = 9.6 kpbs, (b) for r, = 28.8 kpbs. 
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Repeat Prob. 14.2~14 using coherent FSK. 


A noncoherent OOK system is to have P, < 10 ?. Obtain the corresponding 
bounds on y, and P,. 


Do Prob. 14.3-1 with P, < 1075, 
Repeat Prob. 14.2-14 using noncoherent FSK. 


Determine P, for a channel using Sunde's FSK with (S/N) = 12 dB, r, = 14.4 
kbps, and (a) coherent detection, (b) noncoherent detection. 


Obtain Eq. (9) from Eq. (8) with K = A,/E;. 

Suppose the OOK receiver in Fig. 14.3-2 has a simple BPF with 
H(f) =[1+ j2(f — £.)/B] ! for f 0, where 2r, x B < f.. Assuming that 
f. = ry and y, >> 1, show that the signal energy must be increased by at least 5 
dB get the same error probability as an incoherent receiver with a matched filter. 


Consider a noncoherent system with a trinary ASK signal defined by Eq. (7) with 
a, = 0, 1, 2. Let E be the average energy per symbol. Develop an expression similar 
to Eq. (11) for the error probability. 

A binary transmission system with Sy = 200 mW, L = 90dB, and Ny = 10 P 
W/Hz is to have P, < 10 ^. Find the maximum allowable bit rate using: (a) nonco- 
herent FSK; (b) DPSK; (c) coherent BPSK. 

Do Prob. 14.3-8 with P, x 107°. 

A binary transmission system with phase modulation is to have P, - 10 *. Use Eq. 
(18), Sect. 14.2, to find the condition on the synchronization error 0, such that 
BPSK will require less signal energy than DPSK. 

Do Prob. 14.3-10 with P, = 107°. 

Derive the joint PDF in Eq. (3) by rectangular-to-polar conversion, starting with 
x = Å, t n; and y = ny. 

Binary data is to be transmitted at the rate r, = 500 kbps on a radio channel having 
400-kHz bandwidth. 


(a) Specify the modulation method that minimizes signal energy, and calculate y, in 
dB needed to get P,e = 10 6. 


(b) Repeat part a with the additional constraint that coherent detection is not practi- 
cal for the channel in question. 


Do Prob. 14.4—1 with r, = 1 Mbps. 


Binary data is to be transmitted at the rate r, — 800 kbps on a radio channel having 
250-kHz bandwidth. 


(a) Specify the modulation method that minimizes signal energy, and calculate y, in 
dB needed to get P,, = 10 5. 
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(b) Repeat part a with the additional constraint that channel nonlinearities call for a 
constant envelope signal. 


Do Prob. 14.453 with r, = 1.2 Mbps. 


Let the VCO output in Fig. 14.4-6 be 2 cos (w,t + 0,). In absence of noise, show 
that the control voltage u(t) will be proportional to sin 6,. 


Suppose an M-ary QAM system with M = 16 is converted to DPSK to allow phase- 
comparison detection. By what factor must the symbol energy be increased to keep 
the error probability essentially unchanged? 


Suppose a PSK system with M >> 1 is converted to M-ary QAM. By what factor 
can the symbol energy be reduced, while keeping the error probability essentially 
unchanged? 


Obtain the phase PDF given in Eq. (9) from the joint PDF in Eq. (3), Sect. 14.3. 
Hint: Use the change of variable A = (A — A, cos $)/o. 


Using the technique given in Fig. 14.4—2, design a system that will create the carrier 
reference needed for a M-ary PSK receiver. 


Generalize the design of Fig. 14.4—2 to enable the creation of M-reference signals 
for an M-ary PSK receiver. 


For y, = 13 dB, calculate P, for (a) FSK (noncoherent), (b) BPSK, (c) 64-PSK, (d) 
64-QAM. 


A QPSK system has a P, = 10 ?. What is the new P, if we employed TCM with 
m = m = 2 and 8 states? With TCM, does the output symbol rate change? 


Partition a 16-QAM signal constellation in a similar way that was done for the 
8-PSK constellation of Fig. 14.5—4 to maximize the distance between signal points. 
If the original minimum distances between adjacent points are unity, show the new 
minimum distances for each successive partition. 


Given the system of Figs. 14.5—-35 and 14.5—7, with initial state of a, determine the 
output sequence y, y; y, for an input sequence of x;x, = 0001 1001 11 00. 


For the system of Fig. 14.5-7, what is the distance between paths (0, 2, 4, 2) and (6, 
1, 3, 0)? 

Ungerbroeck (1982) increases the coding gain of the m = m = 2, 8-PSK TCM sys- 
tem of Fig. 14.5—7 by adding the following new states and corresponding output 
symbols: (i: 4062, j: 5173, k: 0426, I: 1537, m: 6240, n: 7351, o: 2604, p: 3715). Fol- 
lowing the pattern of Fig. 14.5—7, construct the new trellis diagram and show that 
Es-pskjgrsk 7 4.13 dB. 











CHAPTER OUTLINE 


chapter 


15 


spread Spectrum Systems 


Direct Sequence Spread Spectrum 

DSS Signals DSS Performance in the Presence of Interference Multiple Access 
Frequency Hop Spread Spectrum 

FH-SS Signals FH-SS Performance in the Presence of Interference 

Coding 

Synchronization 

Acquisition Tracking 

Wireless Telephone Systems 


Cellular Telephone Systems Personal Communications Systems 


671 





672 CHAPTER 15 e Spread Spectrum Systems 


Jo prior to World War Il, Hedy Lamarr, a wellknown actress and political refugee from Austria, struck up a con- 


versation with music composer George Antheil that led to a scheme to control armed torpedoes over long distances. 
The technique was immune to enemy jamming and detection. Instead of a conventional guidance system consisting of 
a single frequency signal that could easily be detected or jammed, their signal would hop from one frequency to 
another in a pseudorandom fashion known only to an authorized receiver (i.e., the torpedo}. This would cause the 
transmitted spectrum to be spread over a range much greater than the message bandwidth. Thus, frequency-hopping 
spread spectrum (FH-SS] was born and eventually patented by Lamarr and Antheil. 

Spread spectrum (SS] is similar to angle modulation in that special techniques spread the transmitted signal over 
a frequency range much greater than the message bandwidth. The spreading combats strong interference and pre- 
vents casual eavesdropping by unauthorized receivers. In addition to FH-SS, there is also directsequence spread- 
spectrum [DSS] based on a direct spreading technique in which the message spectrum is spread by multiplying the 
signal by a wideband pseudonoise (PN) sequence. 

We begin our study of spread-spectrum systems by defining direct sequence and frequency hopping systems 
and then examining their properties in the presence of broadband noise, single- and multipletone jammers as well as 
other SS signals. We'll then consider the generation of PN codes that have high values of autocorrelation between 
identical codes (so authorized users can easily communicate} and low values of crosscorrelation between different 
codes [to minimize interference by outsiders). Next we examine the method of code-division-multiple-access (CDMA) 


in which several users have different PN codes but share a single RF channel. Finally, we discuss synchronization and 
wireless telephone systems. 


OBJECTIVES 





After studying this chapter and working the exercises, you should be able to do each of the following: 


1. Describe the operation of DSS and FH-SS systems (Sects. 15.1 and 15.2). 


2. Calculate probability of error for DSS systems under single-tone jamming, broadband noise, and multiple-user 
conditions (Sect. 15.1). 

3. Calculate probability of error for FH-SS for single- and multiple-tone jamming, narrowband and wideband noise 
conditions, and muliple-user conditions (Sect. 15.2). 

4. Design and analyze code generators that produce spreading codes with high autocorrelation and low crosscorre- 
lation values (Sect. 15.3). 

5. Describe how a SS can be used for distance measurement (Sect. 15.3). 

6. Describe SS receiver synchronization and calculate the average time it takes to achieve synchronization 
(Sect. 15.4). 

7. 


Describe the differences between conventional cellular phone and the personal communications systems (PCS) 
(Sect. 15.5). 








15.1 DIRECT SEQUENCE SPREAD SPECTRUM 


DSS is similar to FM in that the modulation scheme causes the transmitted mes- 
sage’s frequency content to be greatly spread out over the spectrum. The difference 
is that with FM, the message causes the spectrum spreading, whereas with DSS, a 
pseudorandom number generator causes the spreading. 
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15.1 Direct Sequence Spread Spectrum 


DSS Signals 


A DSS system and its associated spectra are illustrated in Fig. 15.1-1, where the 
message x(t) is multiplied by a wideband PN waveform c(t) prior to modulation 
resulting in 


x(t) = x(t)c(t) n 


Multiplying by c(t) effectively masks the message and spreads tbe spectrum of the 
modulated signal. The spread signal X(t) can then be modulated by a balanced mod- 
ulator (or a multiplier) to produce a DSB signal. If x(t) had values of +1 that repre- 
sented a digital message, the output from the DSB modulator would be a BPSK 
(PRK) signal. 

Let's look at this more closely. The PN generator produces a pseudorandom 
binary wave c(t), illustrated in more detail in Fig. 15.1~2, consisting of rectangular 
pulses called chips (CPS). Each chip has a duration of T, and an amplitude of +1 so 
that c?(r) =1—an essential condition for message recovery. To facilitate analysis, 
we'll assume that c(t) has the same properties as the random digital wave in Exam- 
ple 9.1-3 when D = T, a, = €X1ando? = c? = 1. Thus, from our previous studies, 





R{r) = MA(t/T,) and GÁf) = T, sinc? L [2] 
which are sketched in Fig. 15.1-2. The parameter 
w, = ES 
Te 
x(Dn x(t) = x(t) cos wt 
x(t) Modulation 
c(t) 
PN COS (wf) 
gen 
GLF) GLI) 


Gf) 
f fe Š Wo è fe fct W, 
-W. W: t z f 
-W, W, 


Figure 15.1-1 DSS transmitter system and spectra. 
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RW 
1 
n T 
~T, T, 
(b) 
Figure 15.1-2 Pseudrandom binary wave. (a] Waveform; (b) autocorrelation and power 
spectrum. 


serves as a measure of the PN bandwidth. Next consider the “chipped message” x(t) = 
x(t)c(t). Treating x(t) as the output of an ergodic information process independent of 
c(t) we have 


xi = E[xXt)et)] = x! = S, 
Recall further that multiplying independent random signals corresponds to multi- 


plying their autocorrelation functions and convolving their power spectra. For clar- 
ity we denote the message bandwidth by W, such that Gf) ~ 0 for |f| > W, and 


Golf) = ef = | | 09647 A) dà 


But effective spectral spreading calls for W, => W, in which case Gif — A) = GAJ) 
over |A| = W,- Therefore 


W. 
Gf) | | G,(A) a| GÁf) = S,GÁf) [3] 
-w, 

and we conclude from Eq. (3) and Fig. 15.1—1 that x(t) has a spread spectrum 
whose bandwidth essentially equals the PN bandwidth W,. With practical systems, 
the bandwidth expansion factor W,/W, can range from 10 to 10,000 (10 to 40 
dB). As will be shown later, the higher this ratio the better the system's immunity to 
interference. 


DSB or DPSK modulation produces a transmitted signal proportional to x(t) cos 
w,t requiring a bandwidth B7 => W, 








15.1 Direct Sequence Spread Spectrum 


z(t) 
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xt) = X() cos wet x(t) + Zp) 
Demodulation G———- LPF 
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Sp = S. = 2Sp 
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Figure 15.1-3 DSS receiver system and the effects of a single-tone jammer. 





We’ll analyze the system’s performance taking unit-amplitude carrier at the 
receiver so Sg = 4x? = $8, and we’ll let z(t) stand for additive noise or interference 
with in-phase component z(t). Synchronous detection after bandpass filtering yields 
y(t) = x(t) + z(t) which is multiplied by a locally generated PN wave to get 


y(t) = [x(¢) + z(0]c(t) [4] 


= x(t)e t) + (e(t) = x(t) + Z1) 


Notice that this multiplication spreads the spectrum of z;(t) but de-spreads x(t) and 
recovers x(f) assuming near perfect synchronization of the local PN generator. 
Final lowpass filtering removes the out-of-band portion of the Z,(t), leaving yp(f) = 
x(t) + zp(t) with output signal power Sp = S, = 2S4. Our next step is to find the 
contaminating power zj at the output. 

When z(t) stands for white noise n(t), the in-phase component n;(t) has the low- 
pass power spectrum back in Fig. 10.1—3 and 


R,{7) E S IG. T NoBr sinc (BrT) 
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x(t) + z(t) x(t) + z(D (k+ DT, yt 
i) SH 





T, m 


c(t) E 


PN 
gen 


Figure 15.1-4 DSS correlation receiver for BPSK. 


The autocorrelation of the chipped lowpass noise 7,(r) equals the product R,(7)R,(7). 
Since R,(7) becomes quite small for |r] = 1/Br << 1/W, whereas R(T) ^ 1 for 
|r| «& T, = 1/W,, we conclude that Rz; (7) ^ R,(7) and hence 


Galf) = G,,(f) = NolI(f/Br) 
The output noise power from the lowpass filter is then Np = 2NọoW,, so 
(S/N)p = 2SR/2N W; = Sp/NoW, [5] 


Comparing this result with Eq. (45), Sect. 10.2, confirms that spread spectrum with 
synchronous detection has the same noise performance as a conventional DSB system. 

Similarly, if our message is digital and sent via BPSK we can use the correla- 
tion detector of Fig. 15.1—4. Thus, in the presence of white noise, the probability of 
error would be 


P, = O(V2E,/No) [6] 


DSS Performance in Presence of Interference 
Fig. 15.1-3 also illustrates the effects of a single-tone jammer with DSS system. Let 
z(f) stand for an interfering sinusoid or CW jamming signal, say 

z(t) ^ V2J cos [(w, + w,)t + 0] 


with average power z= Jat frequency f, + f. Then the in-phase component is 
z(t) = V2J cos (w,t + 0) so that 


G,U) = 50 — A) + 80 2] n 


Multiplication by c(t) spreads this spectrum so that G;(f) = G,(f)*G{f), a 
routine convolution with impulses. Because c(t) is relatively broadband, Zt) 
approximates another broadband noise source with power spectral density J/W,. If 
|A| << W,, then the corresponding upper bound on the output jamming power is 


Ta W. J 
z2 = | Gf) df= 2W, [8a] 


— WwW, 








15.1 Direct Sequence Spread Spectrum 677 


and the signal-to-jammung ratio becomes 


S W, Sp 
2j ee [85] 
JJ) W.J 


The single-tone jammer spectra has been spread and thus, relative to the output sig- 
nal power, has been reduced by a factor W,/W, << 1. This reduction is illustrated in 
the output spectra of Fig. 15.1-3. The bandwidth expansion ratio W,/W, is also 
called the process gain (Pg) or 


Pg = WW, [9] 


which is a measure of a system's immunity to interference. It should be observed 
that the larger the Pg, the better the system's immunity to interference. 

We can then treat the jammer as just another source of noise, and in the case of 
digital information sent via BPSK the probability of error would then be 


P, = O(V2E,/N,) [10] 


where N, = J/W.. If the channel is corrupted by both broadband white noise and a 
CW jammer then 


P, = O| V2E,/(M + Ny) | ul 


If we substitute the received power and jammer power terms of Sp = E,r,and 
N, = J/W, respectively into Eq. (10) we have 


u 2WAr, 
py oly 7S ) [12] 


Substituting r, = W,, and Eq. (9), Eq. (12) becomes 


AT R == [13] 
e= Q ELA 


Now let's say we specify a minimum P, value for Eq. (10). Combining with 
Eq. [13] and converting to dB, we have 


10 log (J/Sr) = 10 log (Pg) — 10 log (E,/N;) [14] 


The term 10 log (7/55) is called the jamming margin and is used as a measure of a 
system's ability to operate in the presence of interference. If, in a given system, we 
specify a minimum P, or minimum E,/N, ratio and a relatively large Pg, then the 
system will have margin against interference. 








A DSS-BPSK system has r, = 3 kbps, Nọ = 107!?, and is received with P, = 107?. EXAMPLE 15.1-1 
Let's calculate the Pg needed for the system to achieve P, = 10^ in the presence 
of a single-tone jammer whose received power is 10 times larger than the correct 
signal. Using Table T.6 and Eq. (6) we get 2E,/Ng = 36 or E, = 1.8 x 107). With 
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Sp = Eyn = 5.4 X 1078, then J = 10S, = 5.4 X 1077 W. If with a jammer, 
P, = 10? then, using Eq. (11) and Table T.6, N; = 1.48 X 107? and N, = J/W» 
W, = 3.6 X 106, If W, = r, then Pg = W,/W, = 1200. 





EXERCISE 15.1-1 


Given the above problem specifications, what is the jamming margin if Pg — 10,000? 





Multiple Access 


Finally, if we are sharing the channel with M — 1 other spread spectrum users—as is 
the case with code division multiple access (CDM A)—each one would have their 
own unique spreading code and arrival times at the receiver and thus the interference 
term becomes, 
M-1 
Z(t) = Y, AnXm(t — tn)Cm(t — tm) COS Om [15] 
m=1 


where Am, C(t), tm, and 0,, denote the signal amplitude, spreading code, time delay, 
and phase, respectively, of the mth user. Thus 


M-1 
H(t) = X(t) + Y, Asxs(t — tn) C(t — tm) cos Om [16] 
m-1 


If we assume each of the other users has identical signal strengths of unit value, then 
after despreading, Eq. (16) becomes, 


y(t) = x(t) + bs Xml — tm)CmlÉ — tm) COS s. c(t) [17] 


m=] 


In the case of BPSK, the output of the correlation receiver would be 


M-1 (k+)T, 
x(t.) + >) | cos Om | Xt ty Out — tm)C(t) dt [18a] 
m=] 


“kT, 


where z(t,) is the cumulative interference of the additional M — 1 CDMA users. 
Notice that since x,,() = +1, the integration term in Eq. (15a) becomes the cross- 
correlation between the desired and the interferer's PN codes. Therefore, minimiz- 
ing the crosscorrelation between spreading codes minimizes the interference 
between CDMA users. Ideally, each PN code would be chosen to be orthogonal to 
the other, thereby making z(t,) = 0. Unfortunately, as will be shown later, with 
practical systems this is not possible. 
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Pursley (1977) and Lathi (1998) derive an analytical expression for the bit error 
probability for M users of a CDMA channel corrupted by white noise. These show 
that if each of the M users have identical signal strengths, then 





P, = Q(/V(M — 1)/GPg) + Ny2E,) [19] 


Note that if the channel has only one user (i.e., M = 1), Eq. (19) reduces to Eq. (6). 


Conversely, if the channel is noiseless but contains other users, the error probability 
would be 


P, = O(V3Pg/(M — 1)) [20] 


Thus, even if the channel is noiseless, the error probability is still nonzero if it con- 
tains other users. 





This has regulatory implications. In the past, FCC regulations regarding maximum 
power levels were dictated by crosstalk considerations. In today’s SS world, addi- 
tional users merely decrease the other’s signal-to-noise ratio. 





30 dB. How many addi- EXERCISE 15.1-2 


A single-user DSS-BPSK system has P, = 107? and Pg = 
tional users can this system support if we allow P, = 1075? 








15.2 FREQUENCY HOP SPREAD SPECTRUM 


As previously noted, the wider the spread spectrum bandwidth, the more the jammer 
has to increase its power to be effective. However, the practical limitations of PN 
sequence generation hardware imposes a constraint on bandwidth spreading and 
therefore the processing gain. To enable even larger processing gains, the PN gener- 
ator can drive a frequency synthesizer that produces a wideband sequence of fre- 
quencies that can cause the data-modulated carrier to hop from one frequency to 
another. This process is called frequency hopping spread spectrum (FH-SS). 
Because the message is spread out over numerous carrier frequencies, the jammer 
has a reduced probability of hitting any one in particular. Or the jammer has to 
spread its power over a wider frequency range in order to be effective. 
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Figure 15.2-1 FH-SS System. (a) Transmitter; [b} receiver. 


FH-SS Signals 


Frequency hopping SS is shown in Fig. 15.2-1 and works as follows. The message is 
usually M-ary FSK modulated to some carrier frequency f., although some systems 
use BPSK. The modulated message is then mixed with the output of a frequency syn- 
thesizer. The frequency synthesizer's output is one of Y — 2* values, where k equals 
the number of outputs from the PN generator. The BPF selects the sum term from the 
mixer for transmission on the channel. The receiver in part (b) of the figure is the 
reverse of this process. Because of practical difficulties in maintaining phase coher- 
ence, most systems use noncoherent detection such as an envelope detector. 

There are two types of FH-SS systems. In slow hop SS one or more message 
symbols are transmitted per hop: in fast hop SS there are two or more frequency 
hops per message symbol. With slow hop SS, the receiver demodulates the signal 
like any other M-ary FSK signal. However, with fast hop SS there are several hops 
per symbol, so the detector determines the value based on either a majority vote or 
some decision rule such as maximum likelihood. 
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Figure 15.2-2 Output frequency versus data input for slow hop FH-SS system of Example 


15.2-1. 





quency hop and has a PN generator with k = 3 outputs. For a given binary message 
sequence, the spectral output is illustrated in Fig. 15.2-2. With k = 3 the system can 
hop to Y = 2* = 8 different carrier frequencies. For each hop the output frequency is 
Sc, + fa where f. is the carrier frequency at hop i and f; is the frequency shift from a 
particular carrier frequency. The number of frequencies clustered around each f. is 
equal to the number of symbols/hop. If the bit rate is r,, then from Eq. (16) of Sect. 
14.1 we have f, = r,/2. With a maximum frequency of f. + f, and a minimum fre- 
quency of f,, — f; then the amount of spreading or transmission bandwidth is By = 


W, = 8r,. From Sect. 15.1 we defined process gain as Pg = W/W, If W, =r, then 
with FH-SS we have 


paso n 





Consider a fast hop SS system with binary FSK, two hops per symbol, and a PN 
generator with outputs with the same binary message of Example 15.2-1. The mes- 
sage is transmitted using the following PN sequence: (010, 110, 101, 100, 000, 101, 


Consider a slow hop SS system with binary FSK that transmits two symbols per fre- 
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EXERCISE 15.2-1 
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011, 001, 001, 111, 011, 001, 110, 101, 101, 001, 110, 001, 011, 111, 100, 000, 110, 
110). In a format similar to that done in Example 15.2-1, plot the output frequen- 
cies for the input message. 





FH-SS Performance in the Presence of Interference 


We now want to consider the performance of a FH-SS system with respect to bit 
errors. There are several types of interference or noise to deal with: white noise of 
barrage jamming, single-tone jamming, partial-band jamming, or interference due 
to multiple FH-SS users on the same band as occurs with CDMA systems. Several 
of these are illustrated in Fig. 15.2-3. 

Slow hop SS is the most susceptible to jamming because one or more symbols 
are being transmitted at a particular frequency. However, if the hop period is shorter 
than the transit time between the jammer and user's transmitter/receiver, then by the 
time the jammer has decided which frequency to jam, the transmitter has already 
hopped to another frequency. With M-ary FSK and noncoherent detection, the bit 
error probability in the presence of white noise is 


P, = dg MN [2] 


If a jammer has the power spectral density of Fig. 15.2-3(a) with N; the power 
level over the entire bandpass of the system, the jammer appears as white noise and 
Eq. (2) becomes 


p, = ber EMANEN] [3] 
where N; = J/W.. 

Figure 15.2—3(b) illustrates partial-band jamming with A the fraction of the 

band being jammed. The quantity N,/A is equal to the jammer’s PSD and has a 


bandwidth of AW,. With A also equal to the probability of being jammed and 1 — A 
equal to the probability of not being jammed then, using the chain rule, we have 


P, = P(e|jammed’)P( jammed’) + P(e|jammed)P( jammed) 
The probability of error for partial-band jamming of FH-SS would then be 


pP = 1-A g Eo! 2No Ay A Blot N/A) [4] 
2 





€ 


In systems in which there are a relatively large number of frequency slots to hop to 
and the jammer’s distance is large compared to the transmitter-receiver distance, the 
probability of a single tone causing an error is quite small. 

Finally, let's consider a CDMA system with M users and M-1 potential inter- 
ferers, as is shown in Fig. 15.2-3(d). Let the probability of one user causing an 
error to another, given that they are using the same frequency slot (thus causing 
a collision c), be 1/2, or P(error|c) — 1/2. Then the probability of an error given 
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Figure 15.2-3 Types of jamming for FH-SS systems. (a) Barrage; [b) partial-band; (c) single- 
tone; (d) multipletone. 


that there is no collision is the same as what occurs when there is only noise, given 
by P(error|c*^) = lg PP. Tf there are Y = 2 frequencies in the CDMA channel, 
then the probability of an intereferer being in a given slot is 1/Y (or the probability of 
not being in that slot is (1 — 1/Y). Therefore, the probability that M — 1 users will not 
interfere is (1 — 1/Y)“~!. Consequently, the probability that M — 1 users could collide 
with the Mth user is P, = 1 — (1 — 1)! = “3 if M «Y. Again using the chain 
rule, we have P, = P(e|c)P(c) + P(e|c?)P(c*) so 


1/M-1 1 M-1 
T + ŻEM ge 5 
P, (( Y ) z? E Y | [5] 
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Let's calculate B; and the error probabilities for a SH-SS system with a message 
rate of 3 kbps, Se = 5.4 X 107? W, given that the PN generator has k = 10 outputs 
and the system has partial band jamming with A = 100%, 10%, and 0.1%. The jam- 
mer power at the receiver is 10 times the signal power and the PSD of the random 
noise is Ny = 107? W/Hz. If k = 10, then Pg = 2? = 1024 = W,/W,. If r, = W, = 
3000 then W, = 3.07 X 10°. With E, = Sy/r,, then E, = 5.4 X 1075/3000 = 1.80 X 
107! and N, = J/W, = 5.4 X 1077/3.07 X 106 = 1.76 X 107, Substituting these 
values into Eq. (4) for A = 1, 0.1, 0.001, we get P, values of 2.37 X 1074, 0.0020, 
and 5.36 X 10~4 respectively. You will note from these that there will be an opti- 
mum value of A that minimizes P,. 


A single-user FH-SS system has P, 


— 1077. For the same number of users as the 
system of Exercise 15.1~2, what is the minimum value of Pg for P, = 107?? 








15.3 CODING 


The PN generator used to generate c(t) and thereby spread the signal is of the same 
type and properties described in Sect. 11.4. The reader should refer back to 
Sect. 11.4 for the discussion of PN generators. 





This objective is met if the PN generator produces an ml sequence. 

To minimize jamming and/or casual eavesdropping, the PN sequence should 
be as long as possible; the longer the sequence, the more effort it takes for an unau- 
thorized listener to determine the PN sequence. However, with linear codes, an 
eavesdropper only requires the knowledge of 2" chips to determine the shift regis- 
ter connections. Therefore vulnerability to unauthorized listening can be reduced 
by either making frequent changes in the PN sequence during transmission or by 
using some nonlinear scheme for the feedback connections. PN codes that are both 
secure and have desirable correlation characteristics are generally difficult to find, 
and thus if the goal is secure communications, then the message should be 
encrypted separately. 

With CDMA systems, in which each user has a unique PN code, the receiver 
should be able to reject other interfering SS signals and/or prevent false correlations. 
As stated by Eqs. (17) and (18), Sect. 15.1, 
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Figure 15.3-1 Auto- and crosscorrelation of [3, 1] and [3, 2] PN sequences. 





Let's consider the crosscorrelation between two different ml sequences. From 
Example 11.4—1, the [3, 1] shift register configuration produces the autocorrelation 
function plotted in Fig. 15.3-1. Note the single periodic correlation peak at 7 = 0, 7 T,. 
It is easily shown that a [3, 2] register configuration produces an ml sequence of 
1110010 and has the same autocorrelation function as the [3,1] register configuration. 
Now if we calculate the crosscorrelation of these two sequences using a shortcut sim- 
ilar to that that we used in Example 11.4—1, and we compare the output from the [3, 1] 
configuration with shifted versions of the [3, 2] configuration, we get the results 
shown in Table 15.3-1 and Fig. 15.3-1. Remember that just as with Example 11.4-1, 
these calculations are based on the PN sequence being a polar NRZ signal. Note that 
there is relatively little difference between the autocorrelation and crosscorrelation 
values, and therefore these particular shift-register configurations would be unsuit- 
able for CDMA systems because we cannot easily discriminate between signals that 
use these PN codes. 

A. PN sequence for CDMA applications should have the following characteris- 
tics: (a) R,,(0),,., relative to R,,(r # 0) should be as large as possible; (b) if s and t 
represent different PN sequences, then |R,,(7)|max should be as small as possible for 
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Table 15.3-1 

7 [3, 1/3, 2] v(7) Ro, uo, (7) = v(/N 

0 1110100 5-223 3/1 2 043 
1110010 

1 1110100 -1 —0.14 
0111001 

2 1110100 3 0.43 
1011100 

3 1110100 -1 —0.14 
0101110 

4 1110100 -1 —0.14 
0010111 

5 1110100 -5 —0.71 
1001011 

6 1110100 3 0.43 
1100101 

7 1110100 3 0.43 
1110010 


all values of 7; and therefore (c) we want to minimize the ratio |R,(7)|max/Rss(O). 
Table 15.3—2 shows the most optimum ratios of |R (T) |, / R,,(0) for various register 
lengths. This table also gives a partial listing of the possible feedback connections 
that enable the generation of ml sequences. 

Unfortunately, as we observe in the plot of Fig. 15.3-1 and the values in Table 
15.3-2, ml sequences generated from a single shift register do not have good cross- 
correlation properties, making them unsuitable for CDMA systems. Also, for a single 
shift register to generate a different output sequence requires changing the feedback 
connections; therefore, a given shift register length will give us relatively few 
unique output sequences. To overcome both of these limitations, Gold codes are 








Table 15.3-2 Crosscorrelation ratios for various ml sequence lengths and feedback connections 
[produced from a single shift register]. Dixon, 1994. 
n N=% -1 JR) haa / 5,00) Feedback Taps 
3 7 0.71 (3, 11, (3, 2] 
5 3] 0.35 [5, 2], [5, 4, 3, 2], (5, 4, 2, 1] 
8 255 " 0.37 [8, 5, 3, 1], [8, 6, 5, 1}, (8, 7, 6, 1] 
9 511 0.22 (9, 4], (9, 6, 4, 3], [9, 8, 6, 5], [9, 8, 7, 6, 5, 3] 
12 4095 0.34 [12, 6, 4, 1], (12, 10, 9, 8, 6, 2], (12, 11, 10, 5, 2, 1] 
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Figure 15.3-2 Gold code generator. 


generated by the mod-2 combination of the output of two or more registers as illus- 
trated in Fig. 15.3—2. For certain combinations of n-bit registers and feedback con- 
nections which we call preferred pairs, Gold (1967, 1968) states that the maximum 
value of crosscorrelation between N = 2" — 1 length sequences is bounded by 


Ra = |o|/N [la] 


where 


(2792 +1 nodd 
= [1b] 


(2*2? 4. 4 neven 
or in some cases 
Ra = —MN [1d 


Table 15.3—3 lists some preferred pairs of registers that will generate Gold codes 
and their respective crosscorrelation upper bound. To see the superior crosscor- 
relation properties of Gold codes versus single register ml sequences, we note from 
Table 15.3-3 that the sequences produced from a 12-bit Gold code will have 
Rsa = 0.031, whereas from Table 15.32 the best of the sequences produced from 
a single 12-shift register will have R,, — = 0.34. 


Table 15.3-3 Some preferred pairs for Gold Code generation and their 
auto/crosscorrelation ratios. Dixon, 1994. 





n Preferred Pair IRy(T) |max/ Res) 
5 (5, 2] [5, 4, 3, 2] 0.290 
7 (7, 3] (7,3, 2, 1] 0.134 
8 (8, 7, 6, 5, 2, 1] [8, 7, 6, 1] 0.129 

10 [10, 8, 5, 1] [10, 7, 6, 4, 2, 1] 0.064 


12 (12, 9, 8, 5, 4, 3] [12, 7, 6, 4] 0.031 
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The autocorrelation function for the Gold code sequence has an off-peak autocorre- 
lation value that is also bounded by Eq. (1) (Proakis, 2001). Therefore, unlike the 
mi sequence produced by a single shift register that has an off-peak autocorrelation value 
bounded by —1/N, the off-peak autocorrelation value from a Gold code sequence could 
be as high as |6|/N. Thus with Gold codes, we obtain a greatly reduced set of crosscor- 
relation values at a cost of a slight increase in the off-peak autocorrelation values. 

For each set of initial conditions in the registers (or register offsets), we get a 
different periodic output sequence. Therefore, two n-bit preferred register pairs will 
give us N = 2" — ] unique output sequences. If we also include the two original 
sequences, our register pair can then produce a total of N + 2 sequences. This is in 
contrast to what was stated earlier, where a given single register configuration can 
only give us one unique periodic output sequence. 

Dixon (1994) has a more extensive list of preferred pairs for Gold code genera- 
tion and also describes an algorithm for the proper selection of preferred pairs used 
to generate Gold codes. 





Ranging using DSS 


À common application of DSS is for distance measurement, and is based on meas- 
uring the phase difference or time delay in chips between transmitter and receiver 
correlation functions. Let's say we have a transmitter that produces a signal v(t) 
with a PN sequence whose autocorrelation is R,,(7) with its maximum value at 7 
= 0. The transmitted signal is reflected off a target at some distance d back to the 
transmitter. Let the reflected signal received back at the transmitter location be 
w(t) = v(t — tj) where t, = 2 d/c, with c being the speed of light. If we correlate 
the received signal with the transmitter PN sequence we get R,,(7) = Ry(T — Ty 
where 7, is KT, chips. The relative phase difference between the two peaks (k chips 
Or 7, = KT, seconds) is the transit time between the transmitted and received signals, 
and is a measure of the distance. Let's say we have a DSS transmitter that sends to a 
target a PN ml sequence of length N — 31 and clock period 10 ns. The sequence is 
received back such that the difference in correlation peak locations 1s 20 chips. The 
distance between the transmitter and target would then be 


= KT,c/2 [2] 
= 20 chips X 10 X 107° seconds/chip X 2.99 x 10? m/s x 1/2 


— 29.00 meters 


hee Nee att alr aN 
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Thus if DSS is used for distance measurement the resolution in meters would be 
Ad = cT, [3] 


An important application of DSS that uses this property of distance measurement 
and triangulation is the global positioning system (GPS). GPS can be used to deter- 
mine the time and a person’s precise latitude, longitude, and altitude on the earth. It 
consists of 24 satellites such that at any one time at least four are visible from any- 
where on earth. Each one is 22,200 km above the earth and circles it every 12 hours. 





EXERCISE 15.3-1 


Show that for any nonzero initial condition, a 5-bit shift register with a [5, 2] con- 
figuration will produce only one unique PN sequence. Generalize your result for any 
single n-bit shift register PN generator. 





15.4 SYNCHRONIZATION 


Up to now, we have assumed that we know the transmitted carrier frequency and 
phase and that there is perfect alignment between the transmitted and receiver PN 
codes. Unfortunately, achieving these objectives is not easy and, in fact, is one of the 
most difficult problems to solve in spread spectrum systems. The initial objective is 
to achieve carrier frequency and phase synchronization. This is made somewhat 
simpler if we use noncoherent detection. We then have to align the transmitter and 
receiver’s PN codes and to maintain synchronization by overcoming frequency drift 
in the transmitted carrier or PN clock. With mobile or satellite SS systems, the car- 
rier frequency and code clock phases can change because of Doppler frequency 
shift! caused by the relative motion between the transmitter and receiver. 

Tracking the incoming carrier frequency and phase is done in the same way as 
with any other digital or analog modulation systems. Specific techniques are discussed 
in Sects. 7.3 and and 14.4. Synchronizing or aligning the receiver PN code to the 
incoming PN code is done in two steps. The first step is called acquisition, where there 
is initial acquisition and a coarse alignment to within a half a chip between the two 
codes. The second step is the ongoing and fine alignment called tracking. Both acqui- 
sition and tracking involve a feedback loop, with tracking done using PLL techniques. 


Acquisition 
Figure 15.4—1 is a block diagram showing a serial search acquisition system for a 


DSS system, and Fig. 15.4—2 shows one for a FH-SS system. Both work in a similar 
manner. The transmitted PN code is contained in y(t), and the receiver generates a 


‘Doppler frequency shift is a change in the perceived received frequency caused by the relative motion 
between transmitter and receiver. Doppler frequency shift equals Af = *ufy/c where v is the relative 
velocity between transmitter and receiver, fy is the nominal frequency, and c is the speed of light. 
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Figure 15.4-1 DS serial search acquisition. 
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Figure 15.4-2 Frequency hopping serial search acquisition. 


replica code, but likely is offset by some phase difference of N, which is an integer 
number of chips. N, is also a random variable whose maximum value is the PN 
period in chips minus one. The goal is that these two PN codes be aligned to within 
a half chip. Then the tracking phase takes over. For initial acquisition, some systems 
use an abbreviated version of the PN code called a preamble. 

Acquisition works as follows. The received signal y(t), which contains | PN 
chips, is multipled by its replica that contains the same number of chips. The prod- 
uct is then integrated from 0 to T, seconds where T, is the PN clock period. If there 
is alignment, the signal will have been de-spread and the integrator’s output will be 
relatively high compared to the threshold V, and therefore the PN code phase is not 
changed. If there isn’t alignment, the threshold circuitry causes the PN code genera- 
tor to increment its phase by half a chip, and the entire process is repeated until 
alignment is achieved. Assuming the system properly recognizes the correct PN 
sequence, the maximum time for acquisition would be 


TSAN. [1] 





15.4. Synchronization 


where the factor 2 occurs because we are correlating in increments of half chips. In 
systems that contain noise, there is a possibility of an incorrect synchronization. 
Equation (1) is modified to obtain the average acquisition time of 

DK A 2 mE Pp 


acq — Pp 





(1 + wPrs)N,IT, [2] 


where Pp is the probability of a correct detection, Ppa is the probability of a false 
synchronization, and o is a factor that reflects a penalty due to a false alarm with 
a >> 1. The variance in acquisition time is 


o = (NIT? + aP E + = 2 +) (3 
Eqs. (2) and (3) are derived by Simon et al. (1994). 

To decrease the acquisition time for DS-SS, we could modify the system of 
Fig. 15.4 —1 to become a sliding-correlator where instead of waiting for the inte- 
gration of the entire / T, sequence, we do a continual integration and feed the result 
to the comparator such that when the output is above threshold, we stop the process 
because alignment has been achieved. Alternatively, we can speed up the process by 


a factor of 2N, by going to a parallel structure and adding 2(N, — 1) correlators to 
the structure of Fig. 15.4-1. 


Tracking 

Once we have achieved course synchronization, tracking or fine synchronization 
begins. For a DSS signal, we can use a delay-locked loop (DLL) as shown in 
Fig. 15.4 3. The received signal y(1) is fed to two multipliers. Each version of y(t) is 
multiplied by a retarded or advanced version of the PN code generator output, that is 
c(t + 8) where ô is a small fraction of the clock period T,. The output of each multi- 
plier is then bandpass filtered and envelope detected. The two outputs are summed 
and fed to a loop filter that controls the VCO (voltage controlled oscillator). If there 
is synchronization error, one of the envelope detectors will have a larger value than 
the other, causing a jump in the summer output, which in turn will cause the VCO to 
advance or retard its output and thereby causes c(f) to straddle between the values of 
c(t + ô) and c(t — ô). 

A simpler alternative to the DLL is the Tau-Dither loop of Fig. 15.4 -4. Here 
only one loop is required, so we don't have to worry about two loops having identi- 
cal gains. It works in a similar manner to the DLL except that one loop is shared by 
the advance and retard signal. When signal q(t) is +1, the input signal y(¢) is multi- 
plied by c(t + ô). This product is then detected, multipled by q = 1 and fed to the 
loop filter. At the next cycle g(f) = — 1, the input signal y(f) is multiplied by c(t — ô). 
It too is enveloped detected, scaled by — 1 and fed to the loop filter. If both signals 
are the same, the VCO output remains unchanged. But, like the DLL, if one signal is 


larger than the other, then the VCO output changes causing the PN code generator to 
change accordingly. 


691 


692 


CHAPTER 15 * Spread Spectrum Systems 







Envelope 
detector 






c(t + à) 


Delay by 


+6 
c(t) 


Delay by 
-6 
c(t — 8) 









y(t) 





+ 
Loop 
filter © 

















Envelope 
detector 


Figure 15.4-3 Delay-locked loop (DL). 
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Figure 15.4-4 Tau-Dither synchronization. 





15.5 WIRELESS TELEPHONE SYSTEMS 


In the last 15 years, wireless telephone systems have undergone a tremendous trans- 
formation from being a luxury to almost a necessity. In fact, formerly theater patrons 
were requested not to smoke; now they are requested to turn off their cell phones! In 
some cases wireless systems have even supplanted land-line phone systems. For 
example, some nations are considering not upgrading their copper-wire telephone 
system but instead replacing them with wireless telephones. Wireless telephones have 
enabled everybody to be reachable by telephone at any time and location. In this 
section we want to consider the two primary wireless systems: cellular telephones 
and personal communications systems or services (PCS). 











15.5 Wireless Telephone Systems 


Cellular Telephone Systems 
Cellular telephones, by definition, are FDMA FM systems that operate in the 


800-MHz band with W = 3 kHz and B; = 30 kHz. They work in the following man- ` 


ner. À given service area is divided up into cells; each cell has a base station we call 
a mobile telephone switching office (MTSO), consisting of a transmitter, receiver, 
and tower for the antenna. The MTSO in turn is connected to a telephone central 
office via telephone lines and thereby interfaces the cell-phone user to the rest of the 
telephone system. When a call is placed, the MTSO authenticates the cell-phone 
user based on the cell phone serial number and assigned phone number and then 
assigns a set of available transmit and receive frequencies to the cell phone. During 
the call, the MTSO monitors the cell phone's signal strength so that if the user 
moves, the MTSO switches the call to a different cell station and thereby maintains 
signal strength. The process is seamless. Increasing the density of cell towers/stations 
in a given area, while not necessarily aesthetically pleasing, enables the cell phones 
and MTSO transmitters to be low power. In turn, this means their signals will not 
propagate much beyond a given cell area, thus enabling much greater reuse of avail- 
able frequencies in other cell areas without interference. Low power also means cell 
phones with smaller physical size and reduced battery requirements. 


Personal Communications Systems 


Recent government telecommunication deregulation has made available the 900 MHz 
and 1.8 GHz (1.9 GHz in Europe) frequencies for portable digital phones using either 
TDMA or DSS-CDMA digital technology. To differentiate themselves from tradi- 
tional analog cellular phone systems, the service providers have called this new sys- 
tem personal communications systems (PCS) or wireless phones—wireless being 
synonymous with the new digital technology. Not only are there more frequencies, but 
digital technology has expanded the number of available services. These include Inter- 
net for e-mail and World-Wide-Web access, navigation aides, messaging, biomedical 
telesensors, and so on. As we stated earlier, the inherent nature of CDMA is that 
cross talk is virtually nonexistent if the spread codes are mutually orthogonal and we 
eliminate, or at least reduce, the need for frequency allocation. More users only raise 
the noise floor, and thus there is no hard limit on the number in a given area. Unlike 
cell phone technology, we don't have to be as strict with respect to maintaining mini- 
mum physical distances between systems that use the same frequencies. Furthermore, 
the FCC has also allowed DSS-CDMA communication to overlay the existing 
800 MHz cell-phone band. These additional users do not pose a serious interference 
problem to existing cell-phone users since a DSS signal received at any unauthorized 
receiver sounds like random noise regardless of the type of detector used. 

More than other terrestrial radio systems, wireless phone systems suffer from 
the near-far problem. This is where you have unequal signal powers arriving at the 
receiver, and the stronger one drowns out the weaker one. Overcoming this problem 
requires receiver amplifiers that have a wide dynamic range, so strong signals will 
not overload the front-end amplifiers and other adaptive gain control techniques. For 
more information on wireless systems see Rappaport (1996). 
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15.6 PROBLEMS 


A DSS-BPSK system has an information rate of 3 kbps, and is operating in an envi- 
ronment where, at the receiver, there is a single-tone interferring signal five times 
more powerful than the received signal and Ny = 107?! W/Hz. Without the jammer, 
the (S/N), = 60 dB. If the required P, = 1077, calculate (a) the chip rate, (b) By. 


A DSS-BPSK system has Pg = 30 dB, and in the presence of a jammer the P, = 
1077. What is the jamming margin? 


A DSS-CDMA system is used for telephone transmitting information at a rate of ` 
6 kbps. The system uses a code rate of 10 MHz and (S/N), = 20 dB with just one 
signal transmitted. The required P, = 107’, and each signal is received with equal 
power. (a) How many users can share the channel? (b) If each user reduces power by 
6 dB, how many can share the channel? 


What is the minimum chip rate required to overcome multipath interference if the 
multipath length signal is 500 meters longer than the direct signal? 


Ten DSS-BPSK users, with each one transmitting information at a rate of 6 kbps, 
want to share a channel via CDMA. Assuming additive noise can be ignored, what 
is the minimum chip rate in order to achieve P, — 1077? 


Repeat Prob. 15.1—5 assuming the channel is corrupted by white gaussian noise such 
that with a single user, P, = 107°. 


A DSS system transmits information at a rate of 9 kbps in the presence of a single- 
tone jammer whose power is 30 dB greater than the desired signal. What is the pro- 
cessing gain required in order to achieve P, « 1077? 


How many customers of equal power can share a CDMA system if each one is using 
DS-BPSK where the processing gain is 30 dB and with P, « 1077? 


A DSS-BPSK user transmits information at a rate of 6 kbps, using a chip rate of 
10 X 106 and the received signal has a P, = 10719. How many users can share the 


channel if the P, requirement was reduced to 10^? and each user reduces their power 
by 3 dB? 


A binary FSK SH-SS system has an information rate of 3 kbps, and is operating in a 
barrage jamming environment where the entire channel is being jammed with a 
power level five times greater than the received signal. Without the jammer, the 
(S/N), = 60 dB and Nọ = 107?! W/Hz. If the required P, = 1077, determine the 
minimum Pg and corresponding Br. 


Determine the minimum Pg and corresponding By for a binary FSK system that has 
an information rate of 3 kbps, 10 users, and P, = 1075. It has been found for one 
user that for one user that P, = 107 !9, 


Given the system in Prob. 15.2-1, what would be Pg and By for a DSS that enables 
the same P,? 
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What is P, for a SH-SS system in the presence of partial band jamming with y = 
10%, J = 6 mW, E, = 2 X 10, Ny = 1077, r, = 6 kbps, and the PN generator has 
k = 10 outputs? 


Calculate the minimum hop rate for a FH-SS system that will prevent a jammer 
from operating five miles away from the receiver. 


Given a four-stage shift register with [4, 1] configuration, where the initial state is 
0100, and 1s driven by a 10 MHz clock, what is (a) its output sequence, and (b) PN 
sequence period? Plot the autocorrelation function produced by this shift register. 


Repeat Prob. 15.3—1 for a four-stage shift register with [4, 2] configuration. 


Show that the output sequence for the [4, 1] shift register has the properties of a ml 
sequence as described in Sect. 11.4. 


What is the output sequence and its corresponding length from a 5-bit shift register 
PN generator with (5, 4] feedback connections with the initial state 11111? Is it a ml 
sequence? 


Given a Gold code generator consisting of preferred pair [5, 2] and [5, 4, 3, 2], 
where both registers initially have all 1s, what is its (a) output sequence, (5) cross- 
correlation bound? 


What chip rate is required for a DSS distance measuring system to achieve a resolu- 
tion of 0.01 miles? 


What is the distance resolution for a DSS system with a 20-MHz code clock rate? 


A 10-MHz code generator drifts 5 Hz/hour. What is the chip uncertainity after one 
day? 


A DSS system operating at a carrier frequency of 900 MHz with a 10-MHz code 


clock is traveling toward us at 500 mph. What is the maximum shift in carrier fre- 
quency and chip uncertainty due to Doppler? 


What is the average acquisition time and its standard deviation for a DSS-BPSK, 
with chip rate of 10 MHz, that uses a serial search of a 2048-bit preamble and has a 
Pp = 0.9, Pra = 0.01, and a = 100? Assume the maximum initial phase difference 
between the input signal’s PN sequence and receiver’s PN generator. 


A DSS-BPSK system uses a code clock of 50 MHz and a 12-stage shift register set 
up for maximal length. Assuming the phase difference between the received signal’s 
PN sequence and the receiver PN generator is half a PN period, calculate the aver- 
age acquisition time and its standard deviation if a serial search is done with Pp = 
0.9, Pm = 0.01, and a = 10. 
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CHAPTER 16 e Information and Detection Theory 


hroughout this book we have studied electrical! communication primarily in terms of 

signals—desired information-bearing signals corrupted by noise and interference sig- 
nals. Although signal theory has proved to be a valuable tool, it does not come to grips 
with the fundamental communication process of information transfer. Recognizing the need 
for a broader viewpoint, Claude Shannon drew upon the earlier work of Nyquist and 
Hartley and the concurrent investigations of Wiener to develop and set forth in 1948 a 
radically new approach he called "A Mathematical Theory of Communication." 

Shannon's paper isolated the central task of communication engineering in this ques- 
tion: Given a message-producing source, not of our choosing, how should the messages 
be represented for reliable transmission over a communicotion channel with its inherent 
physical limitations? To address that question, Shannon concentrated on the message 
information per se rather than on the signals. His approach was soon renamed informa- 
tion theory, and it has subsequently evolved into a hybrid mathematical and engineering 
discipline. 

Information theory deals with three basic concepts: the measure of source informo- 
tion, the information capacity of a channel, and coding as a means of utilizing channel 
capacity for information transfer. The term coding is taken here in ihe broadest sense of 
message representation, including both discrete and continuous waveforms. The three 
concepts of information theory are tied together through a series of theorems that boil 
down to the following: 





The surprising, almost incredible aspect of this statement is its promise of error-free trans- 
mission on a noisy channel, a condition achieved with the help of coding. 

Optimum coding matches the source and channel for maximum reliable information 
transfer, roughly analogous to impedance matching for maximum power transfer. The cod- 
ing process generally involves two distinct encoding/decoding operations, portrayed dia- 
grammatically by Fig. 16.0. The channel encoder/decoder units perform the task of error- 
control coding as discussed in Sects. 13.1-13.3 where we saw how the effects of 
channel noise could be reduced. Information theory goes a step further, asserting that opti- 
mum channel coding yields an equivalent noiseless channel with a well-defined capacity 
for information transmission. The source encoder/decoder units then match the source to 
the equivalent noiseless channel, provided that the source information rate falls within the 
channel capacity. 

This chapter starts with the case of digital or discrete information, including information 
measure, source coding, information transmission, and discrete channel capacity. Many 
concepts and conclusions gained from the study of discrete information carry over to the 
more realistic case of information transmission on a continuous channel, where messages 
take the form of time-varying signals. The Hartley-Shannon law defines continuous channel 
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Figure 16.0 Communication system with source and channel coding. 


capacity in terms of bandwidth and signaHo-noise ratio, and thereby serves as a benchmark 
for comparing the performance of communication systems in the light of information theory. 

The chapter closes with an application of theory to optimum digital detection. We'll 
first introduce signal space as a means of representing and quantilying signals. Then the 
principles of optimum detection will be applied to digital communication. 


OBJECTIVES 


After studying this chapter and working the exercises, you should be able to do each of the 
following: 


1. Define Shannon’s measure of information (Sect. 16.1). 


2. Calculate the information rate of a discrete memoryless source (Sect. 16.1). 

3. Analyze a discrete memoryless channel, given the source and transition probabilities 
(Sect. 16.2). 

4. State and apply Shannon's fundamental theorem for information transmission on a 


noisy channel (Sects. 16.2 and 16.3). 


5. State and apply the Hartley-Shannon law for a continuous channel (Sect. 16.3). 
Draw the vector representation of two or more signals (Sect. 16.4). 


7. Construct the decision regions for MAP detection in a two-dimensional space 
(Sect. 16.5). 
8. Find the decision functions for a MAP receiver, and evaluate the symbol error proba- 


bilities when there are rectangular decision regions (Sect. 16.5). 





16.1 INFORMATION MEASURE 
AND SOURCE ENCODING 


Quite logically, we begin our study of information theory with the measure of infor- 
mation. Then we apply information measure to determine the information rate of 
discrete sources. Particular attention will be given to binary coding for discrete 


memoryless sources, followed by a brief look at predictive coding for sources with 
memory. 
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Information Measure 


The crux of information theory is the measure of information. Here we use informa- 
tion as a technical term, not to be confused with "knowledge" or “meaning”— 
concepts that defy precise definition and quantitative measurement. In the context of 
communication, information is simply the commodity produced by the source for 
transfer to some user at the destination. This implies that the information was previ- 
ously unavailable at the destination; otherwise, the transfer would be zero. 

Suppose, for instance, that you're planning a trip to a distant city. To determine 
what clothes to pack, you might hear one of the following forecasts: 


* The sun will rise. 
* There will be scattered rainstorms. 


* There will be a tornado. 


The first message conveys virtually no information since you are quite sure 
in advance that the sun will rise. The forecast of rain, however, provides informa- 
tion not previously available to you. The third forecast gives you more infor- 
mation, tornadoes being rare and unexpected events; you might even decide to 
cancel the trip! 

Notice that the messages have been listed in order of decreasing likelihood and 
increasing information. The less likely the message, the more information it con- 
veys. We thus conclude that information measure must be related to uncertainty, the 
uncertainty of the user as to what the message will be. Alternatively, we can say that 
information measures the freedom of choice exercised by the source in selecting a 
message. When a source freely chooses from many different messages, the user is 
highly uncertain as to which message will be selected. 

Whether you prefer the source or user viewpoint, it should be evident that 
information measure involves the probability of a message. If x; denotes an arbi- 
trary message and P(x;) = P, is the probability of the event that x; is selected for 
transmission, then the amount of information associated with x; should be some 
function of P;. Specifically, Shannon defined information measure by the logarith- 
mic function 
2 —log, P; = pas ni 

P, 
where b is the logarithmic base. The quantity 7; is called the self-information of 
message x;. The value of J; depends only on Pj, irrespective of the actual message 
content or possible interpretations. For suppose you received a weather forecast 
that said 


I; 


* The sun will rain tornadoes 


It will convey lots of information— being very improbable—despite the lack of 
substance. 
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Although not immediately obvious, the definition in Eq. (1) has several impor- 
tant and meaningful consequences, including 


,20 for OzP,x1 [2a] 
L0 for P;—1 [25] 


Hence, J; is a nonnegative quantity with the properties that J, = 0 if P; = 1 (no 
uncertainty) and J; > J; if P; < P; (information increases with uncertainty). Fur- 


thermore, suppose a souxce produces two successive and independent messages, x; 
and x;, with joint probability P(x;x;) = P,P; then 


Ij = log, E = log, 5 + log, E — ll [3] 
so the total information equals the sum of the individual message contributions. 
Shannon's information measure log, (1/P;) is the only function that satisfies all of 
the properties in Eqs. (2) and (3). 

Specifying the logarithmic base b determines the unit of information. The stan- 
dard convention of information theory takes b — 2 and the corresponding unit is the 
bit, a name coined by J. W. Tukey as the contraction for "binary digit" Equation (1) 
thus becomes J; = log; (1/P;) bits. This convention normalizes information meas- 
ure relative to the most elementary source, a source that selects from just two 
equiprobable messages. For if P(x,) = P(x) = 1/2, then J, = I, = log, 2 = 1 bit. 
In other words, 1 bit is the amount of information needed to choose between two 
equally likely alternatives. 

Binary digits enter the picture simply because any two things can be repre- 
sented by the two binary digits. However, you must carefully distinguish informa- 
tion bits from binary digits per se—especially since a binary digit may convey more 
or less than one bit of information, depending upon its probability. To prevent possi- 
ble misinterpretation, the abbreviation binits is sometimes used for binary digits as 
message or code elements. 


When necessary, you can convert base-2 to natural or common logarithms via 
Inv  logiv 


log v = = = 4 
Par ena M 





If P, = 1/10, for instance, then J; = (logy) 10)/0.301 = 3.32 bits. Hereafter, all 
logarithms will be base-2 unless otherwise indicated. 


Entropy and Information Rate 


Now consider an information source that emits a sequence of symbols selected from 
an alphabet of M different symbols, i.e., an M-ary alphabet. Let X denote the entire 
set of symbols x,, x», ...; xy. We can treat each symbol x; as a message that occurs 
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with probability P; and conveys the self-information J;. The set of symbol probabili- 
ties, of course, must satisfy 


M 
SP, =1 [5] 


We'll assume that the source is stationary, so the probabilities remain constant over 
time. We'll also assume that successive symbols are statistically independent and 
come from the source at an average rate of r symbols per second. These properties 
define the model of a discrete memoryless source. 

The amount of information produced by the source during an arbitrary symbol 
interval is a discrete random variable having the possible values J, b, ..., Iy. The 
expected information per symbol is then given by the statistical average ` 


M M 
H(X) 2 X Bl = XP; log = bits/symbol [6] 
i=l i=l i 

which is called the source entropy. Shannon borrowed the name and notation H 
from a similar expression in statistical mechanics. Subsequently, various physical 
and philosophical arguments have been put forth relating thermodynamic entropy to 
communication entropy (or comentropy). 

But we'll interpret Eq. (6) from the more pragmatic observation that when the 
source emits a sequence of n >> 1 symbols, the total information to be transferred 
is about nH(X) bits. Since the source produces r symbols per second on average, the 
time duration of this sequence is about n/r. The information must therefore be trans- 
ferred at the average rate nH(X)/(n/r) = rH(X) bits per second. Formally, we 
define the source information rate 


R Ê rH(X) bits/sec [7] 


a critical quantity relative to transmission. Shannon asserted that information from 
any discrete memoryless source can be encoded as binary digits and transmitted 
over a noiseless channel at the signaling rate r, = R binits/sec. As preparation for 
the study of source coding, we need to know more about entropy. 

The value of H(X) for a given source depends upon the symbol probabilities P; 
and the alphabet size M. Nonetheless, the source entropy always falls within the limits 


0 = H(X) = log M [8] 


The lower bound here corresponds to no uncertainty or freedom of choice, which 
occurs when one symbol has probability P; = 1 while P; = 0 for i # j—so the 
source almost always emits the same symbol. The upper bound corresponds to max- 
imum uncertainty or freedom of choice, which occurs when P; = 1/M for all i—so 
the symbols are equally likely. 

To illustrate the variation of H(X) between these extremes, take the special but 
important case of a binary source (M — 2) with 


Ep acce» 
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Q) 


0.5 T 





0 0.5 1.0 


Figure 16.1-1 Binary entropy function. 


Substituting these probabilities into Eq. (6) yields tbe binary entropy 





[9] 


1 
H(X) = Op) Š p log + (1 =p) log T= 


in which we've introduced the “horseshoe” function Q(p). The plot of O(p) in 
Fig. 16.1—] displays a rather broad maximum centered at p = 1 — p = 1/2 where 
H(X) = log 2 = 1 bit/symbol; H(X) then decreases monotonically to zero as p > 1 
orl—-pl. 

Proving the lower bound in Eq. (8) with arbitrary M is easily done once you 
note that v log (1/v) — 0 as v — 0. The proof of the upper bound H(X) = log M 
involves a few more steps but deserves the effort. 

First, we introduce another set of probabilities Qi, Q5, ..., Qu, and replace 
log (1/P;) in Eq. (6) with log (Q,/P;). Conversion from base-2 to natural logarithms 
gives the quantity 


2; P log £ = n2 In E 
where it's understood that all sums range from i — 1 to M. Second, we invoke the 


inequality In v = v — 1, which becomes an equality only if v = 1, as seen in 
Fig. 16.1-2. Thus, letting v = Pj/ Q; and using Eq. (5), 


pont ze(2-:) -(z2)- 


Third, we impose the condition 


M 
5 Q: = [10a] 
i=] 
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Figure 16.1-2 


so it follows that 
M Q; 
X Pilg% s1-1=0 [10b] 
- P. 
Finally, taking Q; = 1/M we have 
1 1 
Zh log 5 ay = XP log 5 — X log M = H(X) — log M x 0 


thereby confirming that H(X) = log M. The equality holds only in the equally likely 
case P; = 1/M sov = Q,/P; = 1/Mp, = 1 for alli. 





EXAMPLE 16.1-1 Suppose a source emits r — 2000 symbols/sec selected from an alphabet of size 
M = 4 with symbol probabilities and self information listed in Table 16.1-1. Equa- 
tion (6) gives the source entropy 
H(X)-ix1-—-ix2-4ix3-ix3-21375bis/symbol 
which falls somewhat below the maximum value log M = 2. The information 
rate is 
R = 2000 X 1.75 = 3500 bits/sec 


and appropriate coding should make it possible to transmit the source information at 
the binary signaling rate r, = 3500 binits/sec. 
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Table 16.1-1 

P, 
1/2 
1/4 
1/8 
1/8 


x 


M 


Y Aaw 
V WN 





Suppose a source has M = 3 symbols with probabilities P; = p and Pj = Pj. Show 





EXERCISE 16.1-1 


that H(X) = O(p) + 1 — p. Then evaluate H(X) | max and the corresponding value 
of p. 





Coding for a Discrete Memoryless Channel 


When a discrete memoryless source produces M equally likely symbols, so R = 
r log M, all symbols convey the same amount of information and efficient transmis- 
sion can take the form of M-ary signaling with a signaling rate equal to the symbol 
rate r. But when the symbols have different probabilities, so R = rH(X) < rlog M, 


efficient transmission requires an encoding process that takes account of the vari- 
able amount of info Imation nar oumbnl Lara o wa!'ll inveecticata anurra ancading 


per symbol Mere vue P oinvesltsuta sources Snecdina 
with a binary encoder. Equivalent results for nonbinary encoding just require chang- 
ing the logarithmic base. 

The binary encoder in Fig. 16.1-3 converts incoming source symbols to code- 
words consisting of binary digits produced at some fixed rate r,. Viewed from its 
output, the encoder looks like a binary source with entropy Q(p) and information 
rate r,Q(p) = r,log 2 = ry. Coding obviously does not generate additional infor- 
mation, nor does it destroy information providing that the code is uniquely deci- 
pherable. Thus, equating the encoder's input and output information rates, we con- 
clude that R = rH(X) = r,Q(p) = r, or r,/r = H(X). 

The quantity N = r,/r is an important parameter called the average code 
length. Physically, N corresponds to the average number of binary digits per source 
symbol. Mathematically, we write the statistical average 
















Discrete R = rH(X) Binar 
memoryless "e 
Source TEQUE 








Figure 16.1-3 
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Table 16.1—2 lists some potential codes for the source in Example 16.1—1. Code I is 
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= M 
N= MEN, [11] 


where N; represents the length of the codeword for the ith symbol. 


Shannon's source coding theorem states that the minimum value of N is 
bounded by 


H(X) EN < H(X) +e [12] 


with e being a positive quantity. The value of e can be made arbitrarily small, in the- 
ory, and optimum source encoding achieves the lower bound N — H(X). In practice, 
we often settle for suboptimum coding with N > H(X) if the code has reasonably 
good efficiency. The ratio R/r, = H(X)/N = 1 serves as the measure of efficiency 
for suboptimum codes. 

The source coding theorem presumes a uniquely decipherable code to ensure 
that no information is lost. This requirement imposes an additional but indirect con- 
straint on N. Specifically, as a necessary and sufficient condition for a uniquely deci- 
pherable binary code, the word lengths N; must be such that 


ig 
K= 32% <1 [13] 
i=} 
which is the Kraft inequality. The simplest encoding process generates a fixed- 
length code, all codewords having the same length N; = N. The Kraft inequality in | 
Eq. (13) then becomes K = M2" x 1, so decipherability requires N = log M 
and the resulting efficiency is H(X)/N = H(X)/log M. When H(X) < log M, 
higher efficiency calls for variable-length coding to reduce the average code length 
N. An example should help clarify these concepts. 


a fixed-length code with N = log M = 2 binits/symbol, as compared to H(X) = 
1.75 bits/symbol. The efficiency is H(X)/N = 1.75/2 = 88%—not bad, but we 
can do better with a variable-length code. 

Application of Eqs. (11) and (13) to Code II gives 


N-ixicixicix2-4ix22125« H(X) 
K=21'4+271427427=15>1 


The result N < H(X) is meaningless because K > 1, which tells us that this code is 
not uniquely decipherable. For instance, the code sequence 10011 could be decoded 
as BAABB or CABB or CAD, and so on. Hence, Code II effectively destroys source 
information and cannot be used. 

Code III, known as a comma code, has K < 1 and ensures decipherability by 
marking the start of each word with the binary digit 0. However, the extra comma 
digits result in N = 1.875 > H(X). 


16.1 Information Measure and Source Encoding 


Code IV is a tree code with the property that no codeword appears as the 
prefix in another codeword. Thus, for example, you can check that the code 
sequence 110010111 unambiguously represents the message CABD. This code is 
optimum for the source in question since it has N — 1.75 — H(X) as well as 
K 7 1. 


Table 16.1-2 Illustrative source codes 
x; Pi CodeI Code II Code III Code IV 
A 1/2 00 0 0 0 
B 1/4 01 1 01 10 
C 1/8 . 10 10 011 110 
D 1/8 l1 11 0111 111 
N 2.0 1.25 1.875 1.75 
K 1.0 1.5 0.9375 1.0 


Having demonstrated optimum coding by example, we turn to a general proof 
of the source coding theorem. For this purpose we start with Eq. (105), taking 
Q; = 2 "'/K where K is the same as in Eq. (13), which satisfies Eq. (10a). Then 


i 1 zh 
bw log £ = SP(toe t — N; — log K) = H(X) - N — log K S50 
or, since log K = 0, we have 


N z H(X) 


which establishes the lower bound in Eq. (12). The equality holds when K = 1 and 
P; = Q;. Optimum source encoding with N = H(X) therefore requires K = 1 and 
symbol probabilities of the form 


py 22m i= 1,2,...,M [14] 


so that N; = —log P; = Jj. 

An optimum code must also have equally likely 1s and Os in order to maximize 
the binary entropy at O(p) = 1. Code IV in Table 16.1—2 exemplifies these optimal 
properties. 

Granted, we can’t expect every source to obey Eq. (14), and we certainly can’t 
control the statistics of an information source. Even so, Eq. (14) contains a signifi- 
cant implication for practical source coding, to wit: 





707 


708 


CHAPTER 16 * Information and Detection Theory 


Long before Shannon, Samuel Morse applied this commonsense principle to his 
telegraph code for English letters. (Incidentally, Morse estimated letter probabilities 
by counting the distribution of type in a printer's font.) We'll invoke this principle to 
establish the upper bound in the source coding theorem. 


Let the length of the ith codeword be an integer N; that falls within 
LSN «lctlor 


1 1 
bep c4 NE i [15] 
You can easily confirm that this relationship satisfies the Kraft inequality. Multiply- 
ing Eq. (15) by P; and summing over i then yields 


HX) SN < H(X) +1 [16] 


Hence; a code constructed in agreement with Eq. (15) will be reasonably efficient if 
either H(X) >> 1 or N, ~ log (1/P;) for all i. If neither condition holds, then we 
must resort to the process called extension coding. 

For an extension code, n successive source symbols are grouped into blocks and 
the encoder operates on the blocks rather than on individual symbols. Since each 
block consists of n statistically independent symbols, the block entropy is just 
nH(X). Thus, when the coding rule in Eq. (15) is modified and applied to the exten- 
sion code, Eq. (16) becomes nH(X) <= nN < nH(X) + 1, where nN is the average 
number of binary digits per block. Upon dividing by n we get 


H(X) = Ñ < H(X) + - [17] 


which is our final result. 

Equation (17) restates the source coding theorem with e — 1/n. It also 
shows that N — H(X) as n — oo, regardless of the source statistics. We've 
thereby proved that an nth-extension code comes arbitrarily close to optimum 
source coding. What we haven't addressed is the technique of actual code con- 
struction. Systematic algorithms for efficient and practical source coding are pre- 


sented in various texts. The following example serves as an illustration of one 
technique. 





EXAMPLE 16.1-3 


Shannan Pano Salina 


Shannon-Fano coding generates an efficient code in which the word lengths 
increase as the symbol probabilities decrease, but not necessarily in strict accor- 
dance with Eq. (15). The algorithm provides a tree-code structure to ensure unique 
decipherability. We'll apply this algorithm to the source with M=8 and 
H(X) = 2.15 whose statistics are listed in Table 16.1—3. 
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Table 16.1-3 Shannon-Fano coding 













Coding steps 
Xx; P; Codeword 
A 0 
B 0.15 100 
C 0.15 101 
D 0.08 110 
E 0.08 0 1110 
F 0.02 1 0 11110 
G 0.01 1 1 (ee oa 111110 
H 0.01 1 1 1 111111 























The Shannon-Fano algorithm involves a succession of divide-and-conquer 
steps. For the first step, you draw a line that divides the symbols into two groups 
such that the group probabilities are as nearly equal as possible; then you assign the 
digit 0 to each symbol in the group above the line and the digit 1 to each symbol in 
the group below the line. For all subsequent steps, you subdivide each group into 
subgroups and again assign digits by the previous rule. Whenever a group contains 
just one symbol, as happens in the first and third steps in the table, no further subdi- 
vision is possible and the codeword for that symbol 1s complete. When all groups 
have been reduced to one symbol, the codewords are given by the assigned digits 
reading from left to right. A careful examination of Table 16.1—3 should clarify this 
algorithm. 

The resulting Shannon-Fano code in this case has N — 2.18 so the efficiency is 
2.15/2.18 = 99%. Thus, if the symbol rate is r = 1000, then r, = Nr = 
2180 binits/sec—slightly greater than R = rH(X) = 2150 bits/sec. As comparison, 
a fixed-length code would require N = log 8 = 3 and r, = 3000 binits/sec. 





EXERCISE 16.1-2 


Apply the Shannon-Fano algorithm to the source in Table 16.1-2 (p. 707). Your 
result should be identical to Code IV. Then confirm that this code has the optimum 
property that N; = J; and that Os and 1s are equally likely. 





Predictive Coding for Sources with Memory 


Up to this point we've assumed a memoryless source whose successive symbols are 
statistically independent. But many information sources have memory in the sense that 
the symbol probabilities depend on one or more previous symbols. Written language, 
being governed by rules of spelling and grammar, provides a good illustration of a 
source with memory. For instance, the letter U (capitalized or not) occurs in English 
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text with probability P(U) — 0.02 on the basis of relative frequency; but if the previ- 
ous letter is Q, then the conditional probability becomes P(U | Q) 7 1. Clearly, mem- 
ory effect reduces uncertainty and thereby results in a lower value of entropy than 
would be calculated using absolute rather than conditional source statistics. 

Suppose that a source has a first-order memory, so it “remembers” just one 
previous symbol. To formulate the entropy, let P,; be the conditional probability that 
symbol x; is chosen after symbol x;. Substituting P; for P; in Eq. (6), we have the 
conditional entropy 


^ 1 
H(X|x) = > Plog — 
i » : Pj [18a] 
which represents the average information per symbol given that the previous symbol 
was xj. Averaging over all possible previous symbols then yields 


H(X) = DB A(X | x) [18b] 


An equivalent expression applies to the general case of a qth-order memory. How- 
ever, the notation gets cumbersome because x; must be replaced by the state of the 
source defined in terms of the previous q symbols, and there are M? possible states 
to consider. 

A source with memory is said to be redundant when the conditional probabil- 
ities significantly reduce H(X) compared to the upper bound log M. The redun- 
dancy of English text has been estimated at about 50 percent, meaning that roughly 
half the symbols in a long passage are not essential to convey the information. By 
way of example, yu shd babl t read ths evntho sevrl ltrs r msng. It likewise follows 
that if uncertainty is reduced by memory effect, then predictability is increased. 
Coding for efficient transmission can then be based on some prediction method. 
Here we'll analyze the scheme known as predictive run encoding for a discrete 
source with memory. 

Consider the encoding system in Fig. 16.1—4a, where source symbols are first 
converted to a binary sequence with digit rate r. Let x(i) denote the ith digit in the 
binary sequence, and let x(i) be the corresponding digit generated by a predictor. 
Mod-2 addition of x(i) and x(i) yields the binary error sequence «e(i) such that 
e(i) = O when X(i) = x(i) and e(i) = 1 when x(i) »* x(i). The error sequence is fed 
back to the predictor in order to update the prediction process. The decoding system 
in Fig. 16.1-4b employs an identical predictor to reconstruct the source symbols 
from the mod-2 sum x(i) = x(i) ® e(i). We'll assume the source is sufficiently pre- 
dictable that a predictor can be devised whose probability of a correct prediction is 


p = Ple(i) = 0] > 1/2 


We'll also assume that prediction errors are statistically independent. Hence the 
error sequence entropy is O(p) < 1 and appropriate encoding should allow infor- 
mation transmission at the rater, < r. 
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Run-length us 
decoder : to M-ary = ——- 
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Figure 16.1-4 Predictive coding system for a source with memory. (a] Encoder; 


(b) decoder. 


Predictive run encoding exploits the fact that the error sequence consists of 
more Os than 1s—the better the prediction, the higher the percentage of Os. We 
define a run of n to be a string of n successive Os followed by a 1. Instead of trans- 
mitting these strings of n + 1 digits, the encoder represents the successive values of 


n as k-digit binary codewords. A single codeword can represent any value of n in the 
range 0 =n € m — 1, where 


m=2* —1 


When a run has n = m, the encoder sends a k-digit word consisting entirely of 1s, 
which tells the decoder to wait for the next codeword to determine n. Table 16.1-4 
summarizes the encoding and decoding algorithm. 


Table 16.1-4 Run encoding algorithm 





n Encoder Output (k digits/codeword) Decoder Output (5 m digits/word) 
0 00... 000 1 
1 00...001 01 
2 00...010 001 
m-1 11...110 000...01 


=m Ildul 000... 00 
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Efficient transmission results if, on the average, the codewords have fewer dig- 
its than the runs they represent. In particular, let E be the average number of error 
digits per run and let N be the average number of codeword digits per run, so the 
encoded digit rate is r, = (N/E)r. If N/E < 1 then r, < r and we say that predic- 
tive run encoding achieves rate reduction or data compression. The compression 
ratio N/E is found from the following calculations. 

First, note that a run consists of E = n + 1 error digits, including the 1 at the 
end, so the probability of a run of n is 


P(n) = p'(1 — p) 


Hence, 


Ms 


E= (n + 1)P(n) = (1 — p) Sle + 1l)p" = or [19] 


il 


0 


n 


where we've evaluated the summation using the series expansion 


M8 


(1 =v)? = 14+ 20+ 3^ --- — V (n 1)v" [20] 


x 
y 
e 


which holds when v? < 1. 
Next, since long runs must be represented by more than one k-digit codeword, 
we write 


N — Lk L=1,2,3,... 


where L is related to n via (L — 1)m <n < Lm — 1. The average number of code 
digits per run is then given by 


m-i 2m-1 m-1 eo 
Ak m Pu 2k 2 Pop pe Ms »( SP") > (+ 1")! 
n= n=m n=0 i=0 
The first summation is a standard form and the second can be evaluated using 


Eq. (20), with the final result that 


T p^'-1 , k k 
N = k(1— p)——-(1- p")? = = 





{21] 


Since N depends on k and p, there is an optimum value of k that minimizes N for a 
given predictor. 

Table 16.1~5 lists a few values of p and the corresponding optimum values of 
k. Also listed are the resulting values of N, E, and the compression ratio 
N/E = r,/r. This table shows that predictive run encoding achieves substantial 
compression when we know enough about the source to build a predictor with 
p> 058. 
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Table 16.1-5 Performance of predictive run encoding 

p kot N E N/E 
0.6 1 2.50 2:5 1.00 
0.8 3 3.80 5.0 0.76 
0.9 4 5.04 10.0 0.50 
0.95 6 6.25 20.0 0.31 








In facsimile transmission a text is scanned and sample values are converted to Os 
and 1s for white and black, respectively. Since there are usually many more Os 
than 1s, the source clearly has memory. Suppose a predictor can be built with 


p = 0.9. Estimate an upper bound on the source entropy less than log 2 = 
1 bit/sample. 


EXERCISE 16.1-3 








1 16.2 INFORMATION TRANSMISSION ON DISCRETE 
: CHANNELS 


This section applies information theory to the study of information transmission. 
We'll assume that both the source and the transmission channel are discrete, so we 
can measure the amount of information transferred and define the channel capacity. 
Shannon asserted that, with appropriate coding, nearly errorless information trans- 
mission is possible on a noisy channel if the rate does not exceed the channel capac- 
ity. This fundamental theorem will be examined in conjunction with the binary 
symmetric channel, an important channel model for the analysis of digital commu- 
nication systems. 


Mutual Information 


Consider the information transmission system represented by Fig. 16.2-1. A dis- 
crete source selects symbols from an alphabet X for transmission over the channel. 
Ideally, the transmission channel should reproduce at the destination the symbols 






Source dne 
X 'Transmission 


channel 


Destination 
Y 








Figure 16.2-1 Discrete information transmission system. 
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emitted by the source. However, noise and other transmission impairments alter the 
source symbols, resulting in a different symbol alphabet Y at the destination. We 
want to measure the information transferred in this case. 

Several types of symbol probabilities will be needed to deal with the two alpha- 
bets here, and we'll use the notation defined as follows: 


e  P(x;) is the probability that the source selects symbol x; for transmission; 
.  P(y,) is the probability that symbol y; is received at the destination; 
e P(x y) is the joint probability that x, is transmitted and y; is received; 


.  P(x;|y;) is the conditional probability that x; was transmitted given that yj is 
received; 


e  P(yj|xj) is the conditional probability that y; is received given that x; was 
transmitted. i 


We’ll assume, for simplicity, that the channel is time-invariant and memoryless, so 
the conditional probabilities are independent of time and previous symbol transmis- 
sions. The conditional probabilities P(y;| xj) then have special significance as the 
channel's forward transition probabilities. 

By way of example, Fig. 16.2-2 depicts the forward transitions for a noisy 
channel with two source symbols and three destination symbols. If this system is 
intended to deliver y; = y, when x; = x, and y; = y, when x; = x>, then the symbol 
error probabilities are given by P(y;| x;) forj # i. 

Our quantitative description of information transfer on a discrete memoryless 
channel begins with the mutual information 

^ P(x; | yj) : 
Kx; yj) = log P(x) bits [1] 
which measures the amount of information transferred when x; is transmitted and y; 
is received. To lend support to this definition, we'll look at two extreme cases. On 
the one hand, suppose we happen to have an ideal noiseless channel such that each 
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Figure 16.2-2 Forward transition probabilities for a noisy discrete channel. 
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y; uniquely identifies a particular x,; then P(x;| yj) = 1 and I(x; yj) = log [1/P(x;)] 
—so the transferred information equals the self-information of x; On the other 
hand, suppose the channel noise has such a large effect that y; is totally unrelated to 
x; then P(x |y) = P(x) and I(x; y) = log 1 = 0—so no information is trans- 
ferred. These extreme cases make sense intuitively. 

Most transmission channels fall somewhere between the extremes of perfect 
transfer and zero transfer. To analyze the general case, we define the average 
mutual information 


P(x; | yj) 


EEG bits/symbol [2] 


^ 
IX Y) = D> Pya yj) a > P(xuy) log 
»y »y 
where the summation subscripts indicate that the statistical average is taken over 
both alphabets. The quantity I(X; Y) represents the average amount of source infor- 
mation gained per received symbol, as distinguished from the average information 
per source symbol represented by the source entropy H(X). 


Several different but equivalent expressions for the mutual information can be 
derived using the probability relationships 


P(xiy;) = Pla: | y)Pyj) = PO | x)Px) [3a] 


P(x;) = D Py) Po) XP») [3b] 


In particular, let's expand Eq. (2) as 


i 1 
iy) 2; P6) BE 2, Po) leg P(x; | y) 


xy 


so the first term simplifies to 


1 1 
> X6) log P) = 2 PG) log PG) - H(X) 








Hence, 
KX; Y) = H(X) — H(X|Y) [4] 


where we’ ve introduced the equivocation 


P(x; | yj) 
Equation (4) says that the average information transfer per symbol equals the source 
entropy minus the equivocation. Correspondingly, the equivocation represents the 
information lost in the noisy channel. 

For another perspective on information transfer, we return to Eq. (2) and note 
from Eq. (3a) that P(x | y))/P(x;) = P(y;| x)/P(yj) so KX; Y) = KY; X). Therefore, 
upon interchanging X and Y in Eq. (4), we have 


H(X | Y) = YP(x»j) log [5] 
x,y 
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KX; Y) = H(Y) - H(Y | X) i 
with 
HY) = X PO) AT HO My m ni) oe FL E 


Equation (6) says that the information transferred equals the destination entropy 
H(Y) minus the noise entropy H(Y | X) added by the channel. The interpretation of 
H(Y | X) as noise entropy follows from our previous observation that the set of for- 
ward transition probabilities P(y; | x;) includes the symbol error probabilities. 





EXAMPLE 16.2-1 The binary symmetric channel 
Figure 16.2—3 depicts the model of a binary symmetric channel (BSC). There are 
two source symbols with probabilities 


P) =p Pm) =1-p 
and two destination symbols with forward transition probabilities 
PO | x) = Pd x) =e P(y | x) = Pl x2) =1 -a 


This model could represent any binary transmission system in which errors are sta- 
tistically independent and the error probabilities are the same for both symbols, so 
the average error probability is 


P, = P(x)P(y2 | x1) + P(%2)P(1 | x2) = pa + (1 - pja =a 


Since we know the forward transition probabilities, we'll use I(X;Y) = 
H(Y) — H(Y|X) to calculate the mutual information in terms of p and a. 

The destination entropy H(Y) is easily found by treating the output of the chan- 
nel as a binary source with symbol probabilities P(y,) and P(y;) = 1 — Pi). We 
therefore write H(Y) = OQ[P(y,)] where {( ) is the binary entropy function defined 
in Eq. (9), Sect. 16.1 (p. 703), and 
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Figure 16.2-3 Binary symmetric channel (BSC). 
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obtained with the help of Eqs. (3a) and (35). For the noise entropy H(Y | X), we sub- 
stitute P(x; y;) = P(y; | x;)P(x;) into Eq. (7) to get 


NES] x;) | 


which then reduces to H(Y | X) = O(a). The channel’s symmetry causes this result 
to be independent of p. 


Putting the foregoing expressions together, we finally have 
KX; Y) = Qla + p — 2ap) — O(a) [8] 


so the information transfer over a BSC depends on both the error probability œ and 
the source probability p. If the noise is small, then a <1 and KX; Y) ~ 
O(p) = H(X); if the noise is very large, then a = 1/2 and K(X; Y) = 0. 


H(Y| X) = S Px) 2 PC (y; | xj) log 





Confirm that H(Y | X) = O(a) for a BSC. EXERCISE 16.2-1 





Consider a channel with the property that x; and y; are statistically independent for EXERCISE 16.2-2 
all i and j. Show that H(X| Y) = H(X) and I(X; Y) = 0. 





Discrete Channel Capacity 


We've seen that discrete memoryless channels transfer a definite amount of infor- 
mation J(X; Y), despite corrupting noise. A given channel usually has fixed source 
and destination alphabets and fixed forward transition probabilities, so the only 
variable quantities in /(X; Y) are the source probabilities P(x;). Consequently, 
maximum information transfer requires specific source statistics—obtained, per- 
haps, through source encoding. Let the resulting maximum value of I(X; Y) be 
denoted by 


C, E mos K(X; Y) bits/symbol [9] 
P(x; 


This quantity represents the maximum amount of information transferred per chan- 
nel symbol (on the average) and is called the channel capacity. We also measure 
capacity in terms of information rate. Specifically, if s stands for the maximum sym- 
bol rate allowed by the channel, then the capacity per unit time is 


C= sc, bits/sec [10] 


which represents the maximum rate of information transfer. 
The significance of channel capacity becomes most evident in the light of Shan- 
non’s fundamental theorem for a noisy channel, stated as follows: 
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Figure 16.2-4 Encoding system for a noiseless discrete channel. 








A. general proof of this theorem goes well beyond our scope, but we'll attempt to 
make it plausible by considering two particular cases. 
First, suppose we have an ideal noiseless channel with u = 2" symbols. Then 
I(X; Y) = H(X), which is maximized if P(x;) = 1/y for all i. Thus, 
Ce HOO SIDES Y C — sv [11] 
: P(x; 
Errorless transmission rests in this case on the fact that the channel is noiseless. How- 
ever, we still need a coding system like the one diagrammed in Fig. 16.2-4 to match 
the source and channel. The binary source encoder generates binary digits at the rate 


r, = R for conversion to u-ary channel symbols at the rate s = r,/log u = rj/v. 
Hence, 


RzrQ-—sv-C bits/sec 


and optimum source encoding achieves maximum information transfer with 
R = r, = C. Transmission at R > C would require a coding system that violates 
the Kraft inequality; consequently, decoding errors would occur even though the 
channel is noiseless. 

A more realistic case, including channel noise, is the binary symmetric chan- 
nel from Example 16.2-1 (p. 716). We previously found that K(X; Y) = 
O(a + p — 2ap) — O(a), with O(a) being constant for a fixed error probability 
a. But O(a + p — 2ap) varies with the source probability p and reaches a maxi- 
mum value of unity when a + p — 2ap = 1/2, which is satisfied for any a if 
p — 1/2, that is, equally likely binary input symbols. Thus, the capacity of a BSC is 


C, = 1 - Xa) t2] 


The plot of C, versus o in Fig. 16.2—5 shows that C, ~ 1 fora << 1, but the capac- 
ity rapidly drops to zero as œ — 0.5. The same curve applies for 0.5 = a s lifyou 
replace a with 1 — o, equivalent to interchanging the two output symbols. 
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0 0.5 
Figure 16.2-5 Capacity of a BSC. 


Reliable transmission on a BSC requires channel coding for error control in 
addition to source coding. Our study of error-control coding in Chap. 13 demon- 
strated that the error probability per binary message digit could, in fact, be made 
much smaller than the transmission error probability a. For instance, the (15, 11) 
Hamming code has code rate R, = 11/15 and output error probability Pj, ^ 14a* 
per message digit; if the BSC has a = 107? and symbol rate s, then the Hamming 
code yields P,, = 107° at the message digit rate r, = R,s = 0.73s. Shannon’s theo- 
rem asserts that a better coding system would yield virtually errorless transmission 
at the rate r, = R= C = s(1 — O(a)] ~ s. We'll conclude this section with an 


outline of the argument Shannon used to prove the existence of such a coding sys- 
tem for the BSC. 


Coding for the Binary Symmetric Channel * 


Consider the encoding system diagrammed in Fig. 16.2-6. The BSC has symbol rate 
s and capacity C, = 1 — O(a), with a < 1/2. The source produces information at 
rate R. To maximize the mutual information, an optimum binary source encoder 
operates on the source symbols and emits equiprobable binary digits. Channel cod- 
ing for error control is then carried out in two steps. First, a converter transforms 


Ex to BS d BSC 
MAA 3 =s- a) 


5 128 M 
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Binary 
source 
encoder 


Source 








Figure 16.2-6 Encoding system for a BSC. 
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blocks of binary digits into equiprobably M-ary symbols conveying log M bits/sym- 
bol. Second, the BSC encoder represents each M-ary symbol by a channel codeword 
consisting of N binary channel symbols, hereafter called binits. (Don't confuse the 
channel wordlength N with the average code length N at the output of the source 
encoder.) 

The average information per binit is (log M)/N, and binits are generated at the 
channel symbol rate s. Hence, we can express the source information rate as 


R = s(log M)/N 


where M and N are parameters of the channel encoder. Shannon's theorem requires 


R x C = sC,, equivalent here to (log M)/N =< C. The parameters M and N must 
therefore be related by 


acia os [13] 


with 0 =s e < C,. We'll show that € can be arbitrarily small, so R — C,, and that 
appropriate channel coding makes it possible to recover the M-ary symbols at the desti- 
nation with arbitrarily low probability of error—providing that the wordlength N is very 
large. In fact, we'll eventually let N — co to ensure errorless information transfer. Ideal 
channel coding for the BSC thus involves infinite time delay. Practical coding systems 
with finite time delay and finite wordlength will fall short of ideal performance. 

The reasoning behind large wordlength for error control comes from the vector pic- 
ture of binary codewords introduced in Sect. 13.1. Specifically, recall that all of 2" pos- 
sible words of length N can be visualized as vectors in an N-dimensional space where 
distance is measured in terms of Hamming distance. Let the vector V in Fig. 16.2-7 rep- 
resent one of the M channel codewords, and let V' be the received codeword with n 
erroneous binits caused by transmission errors. The Hamming distance between V' and 
V equals n, a random variable ranging from 0 to N. However, when N is very large, V’ 
almost always falls within a Hamming sphere of radius d < N around the tip of V. The 
decoder correctly decides that V was transmitted if V' is received within the sphere and 
if none of the remaining M — 1 code vectors fall within this sphere. 

But Shannon did not propose an explicit algorithm for codeword selection. 
Instead, he said that randomly selected codewords would satisfy the requirement 





Figure 16.2-7 Vector representation of codewords. 
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when N — oo. Although the random coding approach met with considerable criti- 
cism at first, it has subsequently been recognized as a powerful method in coding 
and information theory. We'll adopt random codeword selection and we'll write the 
probability of a decoding error as 


TS cu S 


Here, P,, is the probability of a noise error, so V’ falls outside the Hamming sphere, 
and P,, is the probability of a “code error" in the sense that two or more selected 
codewords fall within the same Hamming sphere. 

A noise error corresponds to the event n = d. Since transmission errors are sta- 
tistically independent and occur with probability a < 1/2, n is governed by the 
binomial distribution (p. 338) with mean and variance 


n = Na ? = Na(1 — a) 


If we take the Hamming radius as 





d= NB a < p= 1/2 [14a] 
then Chebeyshev's inequality yields the upper bound 
c \ all-a) 
Pae = P(n = d) S =] SS 14b 
ne (n ) (- a, =) N(B m a)? [ ] 


Thus, for any B > a, the probability of a noise error becomes vanishingly small as 
N — oo. 

To formulate the probability of a "code error,” let the vector V in Fig. 16.2-7 
represent one of the selected codewords, and let m denote the number of vectors 
within the Hamming sphere. The remaining M — 1 codewords are chosen randomly 
from the entire set of 2" vectors, and the probability of selecting one of the m vec- 
tors simply equals m/2". Hence, 


P, = (M — 1)m2^" < Mm2^" = m2 "92-4 [15] 


where we've inserted Eq. (13) for M and written C, = 1 — O(a). Now we need an 
upper bound on m as N — co. 

All of the m vectors within a Hamming sphere of radius d represent binary 
words of length N that differ in no more than d places. The number of these words 
that differ in 7 places equals a binomial coefficient, so 


EO (0) 0) 


There are d+ 1 terms in this sum and the last term is the largest since 
d = N/B < N/2; hence, 


eda rr ds: Donc T 
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Then, for the factorial of each of the large numbers N, d = N/B, and 
N — d = N(1 — B), we apply Stirling's approximation 


k! = NV2mkk*e* k>> 1 [16] 
and a few manipulations lead to the upper-bound expression 


mz _ _NB+1 — 2NO(8) [17] 


‘V 2aNB(1 — B) 

Combining Eqs. (15) and (17) then gives 
pou 
"^ — VNB — B) 


which is our final result. 

Equation (18) shows that the probability of a decoding error caused by random 
codeword selection goes to zero as N — oo, providing that e > O(8) — O(a). For 
a given value of the channel parameter a, we can take B > a to satisfy Eq. (14) and 
still have an arbitrarily small value of e  O(B) — O(a). Under these conditions, 
we achieve errorless information transfer on the BSC with R — C as N — oo. 


2.7 Nle* Oa) 9(8)] [18] 





16.3 CONTINUOUS CHANNELS AND SYSTEM 
COMPARISONS 


Having developed concepts of information transmission for the simplified discrete 
case, we're now ready to tackle the more realistic case of a continuous source and 
channel. 

We'll begin with the measure of information from a source that emits a contin- 
uous signal. The material may seem heavy going at first, but we'll then make some 
reasonable assumptions about the transmission of continuous signals to express 
channel capacity in terms of bandwidth and signal-to-noise ratio, a result known as 
the Hartley-Shannon law. 'This result leads us to the definition of an ideal communi- 
cation system, which serves as a standard for system comparisons and a guide for 
the design of improved communication systems. 


Continuous Information 


A continuous information source produces a time-varying signal x(t). We'll treat the 
set of possible signals as an ensemble of waveforms generated by some random 
process, assumed to be ergodic. We'll further assume that tbe process has a finite 
bandwidth, meaning that x(t) is completely characterized in terms of periodic sam- 
ple values. Thus, at any sampling instant, the collection of possible sample values 
constitutes a continuous random variable X described by its probability density 
function p(x). 


EXPONITUR ETOSAN 
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The average amount of information per sample value of x(t) is measured by the 
entropy function 


co 


H(X) Ê | p(x) log di [1] 


where, as before, the logarithmic base is b = 2. This expression has obvious simi- 
larities to the definition of entropy for a discrete source. However, Eq. (1) turns out 
to be a relative measure of information rather than an absolute measure. The 
absolute entropy of a continuous source can, in principle, be defined from the fol- 
lowing limiting operation. 

Let the continuous RV X be approximated by a discrete RV with values x; = i Ax 
for i = 0, +1, £2, ..., and probabilities P(x;) ^ p(x) Ax. Then, based on the for- 
mula for discrete entropy, let the absolute entropy be 


1 
HX) = dm, 2, Pax rv 





- im 2 Eo log = p(x;) log Ax As 


p(x;) 
Passing from summation to integration yields 
Ha(X) = H(X) + H(X) [2a] 
where 
H(X) = -lim log As] p) dx = -= lim log Ax [2b] 


which is the reference for the relative entropy H(X). Since H(X) = —log 0 = œ, 
the absolute entropy of a continuous source is always infinite—a useless but reason- 
able result in view of the fact that X is a continuous RY with an uncountable number 
of possible values. 

Relative entropy, being finite, serves as a useful measure of information from 
continuous sources if you avoid misleading comparisons involving different refer- 
ences. In particular, consider two information signals x(t) and z(t). For a valid com- 
parison. of their entropy functions we write 


Hyps(Z) — Ha(X) = H(Z) - H(X) + [Ho(Z) — Ho(X)] 


where 
H,(Z) — H(X) = -lim log Az + dm log Ax [3] 
: Az| dz 
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If the signals are related in some manner and if |dz/dx| = 1, then the reference 
values are equal and H(Z) and H(X) are directly comparable. 

The inherent nature of relative entropy precludes absolute bounds on H(X). In fact, 
the value of H(X) can be positive, zero, or negative, depending upon the source PDF. 
Nonetheless, a reasonable question to ask is: What p(x) maximizes H(X) for a given 
source? 'This question is significant because a given source usually has specific signal 
constraints, such as fixed peak value or average power, that limit the possible PDFs. 

Stating the problem in more general terms, we seek the function p = p(x) that 
maximizes an integral of the form 


b 
I= | F(x, p) dx 


a 


whose integrand F(x, p) is a specified function of x and p. The variable function p is 
subject to a set of k constraints given by 


b 

| F(x, p) dx = c 

a 
with the c, being constants. A theorem from the calculus of variations says that J is 
maximum (or minimum) when p satisfies the equation 


OF OF, 

op k op 
where the A, are Lagrange’s undetermined multipliers. The values of À, are found 
by substituting the solution of Eq. (4) into the constraint equations. 


In the problem at hand, we wish to maximize 7 = H(X) as defined by Eq. (1). 
Thus, we take 


0 [4] 


F(x, p) = p log (1/p) = -p (In p)/In 2 
Furthermore, p(x) must obey the essential PDF property 





| 594-1 i5 


-00 
so we always have the constraint function and constant 
F(x, p) = p = 1 


Additional constraints come from the particular source limitations, as illustrated in 
the important example that follows. 





EXAMPLE 16.3-1 Source entropy with fixed average power 
Consider the case of a source with fixed average power defined in terms of p(x) by 
sfx = | x? p(x) dx [6] 
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which imposes the additional constraint function F, = x*p and c; = S. Inserting F, 
F, and F, into Eq. (4) yields 


—(Inp +1) +A, + àx? =0 


where In 2 has been absorbed in A, and A,. Thus, 





p(x) z2 euT lgd 
After using Eqs. (5) and (6) to evaluate the multipliers, we get 
l j 
p(x) = LE [7] 
27S 


a gaussian function with zero mean and variance o? = S. 


The corresponding maximum entropy is calculated from Eq. (1) by wnting 
log 1/p(x) = log V 27S + (x?/25) log e. Therefore, 


H(X) = log V27S + 4 log e =} log 2weS [8] 


obtained with the help of Eqs. (5) and (6). Note that this relative entropy has a neg- 
ative value when 27reS < 1. Even so, for any fixed average power S, we've estab- 
lished the important result that 

H(X) € } log 2veS 


and that the entropy is maximum when p(x) is a zero-mean gaussian PDF. Other 
Source constraints of course lead to different results. 





Suppose a source has a peak-value limitation, such that —M x x(t) = M. (a) By EXERCISE 16.3-1 


finding the PDF that maximizes E(X), show that H(X) = log 2M. (b) Let the 
source waveform be amplified to produce z(t) = Kx(t). Find H(Z) and confirm that 
the absolute entropy remains unchanged even though H(Z) # H(X). 





Continuous Channel Capacity 


Information transfer on a continuous channel takes the form of signal transmission. 
The source emits a signal x(t) which, after corruption by transmission noise, 
appears at the destination as another signal y(t). The average mutual information 
is defined by analogy with the discrete case to be 


I(X; Y) S | | Pxy(x, y) log ————— dx dy [9] 


where p y(x) is the source PDF, pxy(x, y) is the joint PDF, and so on. Averaging with 
respect to both X and Y removes the potential ambiguities of relative entropy. Thus, 
I(X; Y) measures the absolute information transfer per sample values of y(t) at the 
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destination. It can be shown from Eq. (9) that (X; Y) = 0 and that 1(X; Y) = 0 
when the noise is so great that y(t) is unrelated to x(t). 

Usually, we know the forward transition PDF py(y | x) rather than py(x | y). We 
then calculate /(X; Y) from the equivalent expression 


KX; Y) = HY) - H(Y | X) [10] 


in which H(Y) is the destination entropy and H(Y | X) is the noise entropy given by 


dx dy 


& |x) = | | moro log — — 
a Py(y | x) 
If the channel has independent additive noise such that y(t) = x(t) + n(t), then 


Dy(y|x) = py(x + n) = py(y — x), where py(n) is the noise PDF. Consequently, 
H(Y | X) reduces to 





HQ | X) = | mie) log — 


dn 
EUN Py(n) Ru 


independent of px(x). 


Now consider a channel with fixed forward transition PDF, so the maximum 
information transfer per sample value of y(t) is 


C, 2 maz KX; Y) bits/sample [12] 
Px 
If the channel also has a fixed bandwidth B, then y(t) is a bandlimited signal com- 
pletely defined by sample values taken at the Nyquist rate f, = 2B-—identical to the 
maximum allowable signaling rate for a given bandwidth B. (Samples taken at 
greater than the Nyquist rate would not be independent and carry no additional 
information.) The maximum rate of information transfer then becomes 


C S 2BC, bits/sec [13] 


which defines the capacity of a bandlimited continuous channel. Shannon's funda- 
mental theorem for a noisy channel applies here in the sense that errorless transmis- 
sion is theoretically possible at any information rate R = C. 

The continuous channel model of greatest interest is known as the additive 
white gaussian noise (AWGN) channel, defined by the following properties: 


1. The channel provides distortion-free transmission over some bandwidth B, and 
any transmission loss is compensated by amplification. 


2. The channel constrains the input from the source to be a bandlimited signal x(t) 
with fixed average power S = x? 


3. The signal received at the destination is contaminated by the addition of band- 


limited white gaussian noise n(t) with zero mean and average noise power 
N =r = NS. 


4. The signal and noise are independent so that y(t) = x(t) + n(t) and 


EEEE A A ET ey 
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These properties agree with the assumptions made in earlier chapters to obtain 
upper bounds on the performance of analog systems. Here, we'll determine the cor- 
responding channel capacity. 

First, we find the noise entropy H(Y | X) via Eq. (11). Since py(n) is a zero- 
mean gaussian function with variance o? = N, it follows from Eqs. (7) and (8) that 
H(Y | X) = $ log 2«eN. Second, noting that H(Y | X) does not depend on the source 
PDF, we use Eqs. (10) and (12) to write 


C, = max[A(Y) — H(Y|X)] = | max Ho | — 4 log 27eN 
Px(x) Px(x) 

But the destination signal y(t) has fixed average power y^-S-N, so 

H(Y) = į log 2re(S + N). If py(x) is a zero-mean gaussian function, then 

y = x + nhas a gaussian PDF and H(Y) is maximized. Hence, 





S+N 
C, = } log 2re(S + N) ~ 1og 2reN = } log ( m ) 


Finally, substituting for C, in Eq. (13) yields the simple result 
C = B log (1 + S/N) [14] 


where we recognize S/N as the signal-to-noise ratio at the destination. 

Equation (14) is the famous Hartley-Shannon law. When coupled with the 
fundamental theorem, it establishes an upper limit for reliable information transmis- 
sion on a bandlimited AWGN channel, namely, R = Blog (1 + S/N) bits/sec. 
Additionally, since bandwidth and signal-to-noise ratio are basic transmission 
parameters, Eq. (14) establishes a general performance standard for the comparison 
of communication systems. We therefore devote the remainder of this section to 
implications of the Hartley-Shannon law. 


Ideal Communication Systems 


Virtually every realistic communication system is capable of transmitting continu- 
ous signals, subject to some power and bandwidth limitations. Furthermore, if we 
exclude radio transmission with fading, careful system design can largely eliminate 
all other contaminations except unavoidable thermal noise. The AWGN channel 
thus serves as a reasonable model under these conditions, and the Hartley-Shannon 
law gives the maximum rate for reliable communication. Therefore, we here define 





Shannon described the operation of such an ideal system as follows. 
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T x(t) y(t) Destination 
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signals copies 
Figure 16.3-1 Ideal communication system with AWGN channel. 


The ideal system has the structure diagrammed in Fig. 16.3-1. Information 
from the source is observed for intervals of 7 seconds and encoded as equiprobable 
M-ary symbols, such that (log M)/T = R. For each symbol, the signal generator 
emits a unique waveform x(t) selected from a set of M = 2°" signals. These signals 
are generated by a bandlimited white gaussian process, and stored copies are avail- 
able at the destination. The signal detector compares the received signal 
y(t) = x(t) + n(t) with the stored copies and chooses the most probable one for 
decoding. The operations of encoding at the source and signal detection at the desti- 
nation result in a total delay of 2T seconds, called the coding delay. 


The number of different signals required depends on the desired error probabil- 
ity P,, and Shannon showed that 


. . log M S 
lm lim = Blog) 1 t — 
Pœ Too T N 





Hence, R — C in the limit as T — co— which means that the number of signals and 
the coding delay becomes infinitely large, so the ideal system is physically unrealiz- 
able. However, real systems having large but finite M and T can be designed to come 
as close as we wish to ideal performance. (One such system is described in Sect. 16.5.) 
With this thought in mind, let's further consider the properties of a hypothetical ideal 
system. 

The capacity relation C = Blog (1 + S/N) underscores the fundamental role 
of bandwidth and signal-to-noise ratio in communication. It also shows that we can 
exchange increased bandwidth for decreased signal power, a feature previously 
Observed in wideband noise-reduction systems such as FM and PCM. The Hartley- 
Shannon law specifies the optimum bandwidth-power exchange and, moreover, sug- 
gests the possibility of bandwidth compression. 

Keeping in mind that noise power varies with bandwidth as N = NoB, we 
explore the trade-off between bandwidth and signal power by writing 


S 
C= Bios(1 +) 15] 
NoB 


TS 
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Figure 16.3-2 Trade-off between bandwidth and signal power with an ideal system. 


Thus, if Ny and R have fixed values, information transmission at the rate R= C 
requires 


EN > B R/B — ] 

Ud (2 1) [16] 
which becomes an equality when R = C. Figure 16.3—2 shows the resulting plot of 
S/NoR in dB versus B/R. The region on the lower left corresponds to R > C, a for- 
bidden condition for reliable communication. This plot reveals that bandwidth 
compression (B/R < 1) demands a dramatic increase of signal power, while band- 
width expansion (B/R > 1) reduces S/NoR asymptotically toward a distinct limit- 
ing value of about —1.6 dB as B/R — co. 

In fact, an ideal system with infinite bandwidth has finite channel capacity given by 





C, = lim B tos( 1 eee ) ge A [7] 
B—oo NoB No In 2 No 
Equation (17) is derived from Eq. (15) written in the form 
sh (1 + A) S 
NA  1n2 NB 


The series expansion In (1 + A) = A — 1A? + + then shows that [In (1 + A)]/A— 1 
as A — 0, corresponding to B > oo. Note that C,, is the maximum capacity for 
fixed S and N,, so 


S/NoR = S/NSC,, = 1n 2 = —1.6 dB 
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From the shape of the curve in Fig. 16.3-2, we conclude that C ~ C,, for B/R > 10. 

It must be stressed that these results pertain to an ideal but unrealizable system. 
Consequently, they may be used to estimate upper limits on the performance of any 
real system whose transmission channel approximates the AWGN model. An exam- 
ple should help illustrate the value of such estimates. 


TV pictures from Mars 


A roving vehicle equipped with a monochrome TV camera is proposed to continue 
the exploration of the surface of Mars. The TV pictures will be digitized for trans- 
mission back to Earth. We want to estimate the time required to transmit one picture, 
given the following information. 

A digitized picture will consist of n, = 400 x 300 pixels (picture elements), 
each pixel having one of 16 possible brightness levels. Hence, the information rate R 
and picture transmission time T are related by 


RT = n, log 16 = 480,000 bits 


assuming equally likely brightness levels. 

The Mars-to-Earth link is a microwave radio system with carrier frequency 
f. ^ 2 GHz and path length £ = 3 x 10? km. The transmitter on the vehicle deliv- 
ers Sr = 20 W signal power to a dish antenna one meter in diameter. The Earth sta- 
tion has a 30-m dish antenna and a low-noise receiver with noise temperature 
Ty = 58 K. The signal power S at the receiver is calculated using L = 268 dB, 
gr = 26 dB, and gp = 56 dB, so 


s= SEs, = 5 x 10718 W 


The noise density at the receiver is 
No EX kT y = 8 X 1072 W/Hz 


obtained from Eq. (6), Sect. 9.4. 
Since no transmission bandwidth was specified, let's assume that B/R > 10. 
An ideal system would then have 


R = C = C = 1445/N, = 9000 bits/sec 


and the corresponding bandwidth must be B > 10R = 90 kHz. Therefore, the 
transmission time per picture is 


480,000 bits 
=> 


= 9000 bits/sec 7 59S 


A real system, of course, would require more time for picture transmission. The 
point here is that no system with the same assumed specifications can achieve a 
smaller transmission time—unless some predictive source encoding reduces RT. 
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EXERCISE 16.3-2 


A keyboard machine with 64 different symbols is connected to a voice telephone 
channelhaving B — 3 kHz and S/N9B — 30 dB. (a) Calculate the maximum pos- 
sible symbol rate for errorless transmission. (b) Assuming B can be changed 
while other parameters are fixed, find the symbol rate with B = 1 kHz and B — oo. 





System Comparisons 


Having determined the properties of an ideal communication system, we're ready to 
reexamine various systems from previous chapters and compare their performance 
in the light of information theory. Such comparisons provide important guidelines 
for the development of new or improved communication systems, both digital and 
analog. For all comparisons, we'll assume an AWGN channel with transmission 
bandwidth B7 and signal-to-noise ratio (S/N)s = Sy/NoB; at the receiver. 

Consider first the case of binary baseband transmission. Previously, we found 
the maximum signaling rate 2B; and minimum transmission error probability 
ol V (S/N) a which requires the use of polar sinc-pulse signals and matched filter- 
ing. Now we'll view this system as a binary symmetric channel with symbol rate 
s = 2B, and transmission error probability a = ol V (S/N) zi Hence, the BSC sys- 
tem capacity is 


C-2B,1-0(2]  «-Q[V(S/N).] [18] 


where Q(æ) is the binary entropy function from Eq. (9), Sect. 16.1. With sufficiently 
elaborate error-control coding, we can obtain nearly error-free information transfer 
at R =C. 

Figure 16.3—3 shows C/Bry versus (S/N), for a BSC and for an ideal continuous 
system with C/B;, = log [1 + (S/N)4]. The BSC curve flattens off at C/B; = 2 as 
(S/N)a — œ because the maximum entropy of a noise-free binary signal is one bit 
per symbol. Hence, we must use M-ary signaling to get closer to ideal performance 
when (S/N), >> 1. At lower signal-to-noise ratios, where C/B; < 2, the gap 
between the BSC and ideal curves suggests that there should be a way of extracting 
more information from a noisy binary signal. This observation has led to the sophis- 
ticated technique known as soft-decision decoding in which binary codewords are 
decoded from sample values of the continuous signal-plus-noise waveform, rather 
than from a regenerated (hard-decision) binary waveform. 

Now let's examine the performance of M-ary digital communication systems 
without error-control coding. We'll assume a binary source that emits equiprobable 
symbols at rate r,. If the error probability is small, say P,, = 1074, then the infor- 
mation rate is R = r, since almost all of the binary digits correctly convey informa- 
tion to the destination. For transmission purposes, let the binary data be converted to 
a Gray-code M-ary signal with M = 2*. An M-ary baseband system with polar sinc 
pulses and matched filtering then has 


She up -*(1- ijo SK [19] 
B, w~ K M M1 


where y, = E,/No and E, = Sp/r,, which is the average energy per binary digit. 
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Figure 16.3-3 


Upon solving Eq. (19) for the value of y, that gives Pj, ^ 1074, we obtain the 
plot of r,/B4 versus y, in Fig. 16.3—4. This same curve also applies for bandpass 
transmission via APK modulation with M replaced by VM , while other modulation 
methods have poorer performance. The comparison curve for an ideal system is cal- 
culated from Eq. (16) taking R = r, and S/NoR = Sp/Nor, = Yp. We thus see that 
real digital systems with r,/B; = 2 and small but nonzero error probability require 
at least 6—7 dB more signal energy than an ideal system. Error-control coding would 
reduce the energy needed for a specified error probability, thereby shifting the sys- 
tem performance curve closer to the ideal. 

Finally, we come to analog communication in which bandwidth and signal- 
to-noise ratio are the crucial performance factors, rather than information rate 
and error probability. Consider, therefore, the analog modulation system repre- 
sented by Fig. 16.3—5, where the analog signal at the destination has bandwidth 
W and signal-to-noise ratio (S/N)p. The maximum output information rate is 
R = Wlos (1 + (S/N)5], and it cannot exceed the transmission channel capacity 
C = Brlog [1 + (Sp/NpBr) |. Setting R = C and solving for (S/N), yields 


S Bis a ( 2 
—] s{1+ -1l=(14+- —1 [20] 
Cr. 


where the normalized parameters b and y are defined by 
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Figure 16.3-4 Performance comparison of ideal system and M-ary system with Pj, = 1074. 
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C = Brlog ( o ) Demod 
NoBr R = Wlog [1 + (S/N)p] 


Figure 16.3-5 Analog modulation system. 


Equation (20) becomes an equality for an ideal system operating at full capacity, so 
we have a standard for the comparison of all other analog communication systems. 
In particular, we'll compare the analog performance curves given by (S/N) versus 
y with fixed bandwidth ratio b, and the power-bandwidth exchange given by y ver- 
sus b with fixed (S/N)p. 

Figure 16.3-6 repeats some of our previous analog performance curves for 
b — 6, together with the curve for an ideal system. The heavy dots mark the thresh- 
old points, and we see that wideband noise-reduction systems (FM, PCM, and PPM) 
fall short of ideal performance primarily as a result of threshold limitations. Thresh- 
old extension techniques may improve performance by a few dB, but threshold 
effect must always occur in wideband analog modulation; otherwise, the curves 
would cross into the forbidden region to the left. 

Figure 16.3—7 depicts the power-bandwidth exchange of analog modulation sys- 
tems, taking (S/N)p = 50 dB and S, = x? = 1/2. SSB and DSB appear here as single 
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Figure 16.3-6 Performance comparison of analog modulation systems. 
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Figure 16.3-7 


Power-bandwidth exchange of analog modulation systems with (S/N)p = 50 dB. 








16.4 Signal Space 


points since b is fixed, and SSB is equivalent to an ideal system with b = 1— meaning 
no wideband improvement. The FM and PPM curves are calculated from previous 


equations, and the PCM curve corresponds to operation just above threshold. Observe . 


that PCM comes closest to the ideal when we want b > 1 for wideband improvement. 

Based on these two comparison figures, we conclude that digital transmission 
via PCM makes better use of the capacity of a continuous channel than conventional 
analog modulation. This difference between digital and analog transmission of ana- 
log information stems from the fundamental fact that transmission noise prevents 
exact reproduction of an analog waveform. The PCM quantization process takes 
advantage of this fact and provides a better match of signal to channel than does 
analog modulation. 

Shannon went further in this direction by proposing that tbe information rate of 
a continuous source be measured in terms of some specified fidelity or distortion cri- 
terion. Appropriate digital source coding then allows efficient information transmis- 
sion over a channel whose capacity equals the rate in question. Shannon's proposal 
has subsequently been developed into the field known as rate distortion theory. An 
introduction to this theory is given by Proakis and Salehi (1994, Chap. 4). 


16.4 SIGNAL SPACE 


As we have seen, digital transmission is the most efficient form of communication 
in the light of information theory. Thus we should give some attention to the theory 
of optimum digital detection. First, however, we need a method of representing sig- 
nals, which is the purpose of this section. In particular, we'll introduce the important 
technique of representing signals as vectors, a technique originated by Kotel’ nikov 
in his theory of optimum noise immunity. 


Signals as Vectors 


A signal space is a multidimensional space wherein vectors represent a specified set 
of signals. By describing signals as vectors, the familiar relations and insights of 
geometry can be brought to bear on signal analysis and communication system 
design. Here we introduce those aspects of signal-space theory relevant to digital 
detection, restricting our attention to the case of real energy signals. 

Consider any two real energy signals u(t) and w(t). If œ and f are finite real 
constants, then the linear combination z(t) = av(t) + Bw(t) is also an energy sig- 
nal. Formally, we say that the set of all real energy signals is closed under linear 
combination, which is the essential condition for establishing a linear space. We 
thus define the signal space I to be a linear space wherein vectors such as v and w 
represent energy signals u(t) and w(t), and any linear combination such as 
z = av + Bw is another vector in I. 

Vector addition and scalar multiplication in I’ obey the usual rules, and there 
exists a zero element such that v + 0 = v and v — v = 0. However, to complete 
the geometric structure of I, we need appropriate measures of vector length and 
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Figure 16.4-1 


angle expressed in terms of signal properties. (We can’t use Hamming distance here 

- because we're not necessarily working with binary sequences.) Since we're dealing 
with energy signals, we'll define the length or norm of a signal vector v to be the 
square root of the signal energy, written as 


lull 2 VE = LE v) a tu 


—oo0 
This vector norm is a nonnegative quantity, and |v| = 0 only if v(t) = 0. We use 


the square root of the signal energy in Eq. (1) so that the norm of the sum 
z(t) = u(t) + w(t) satisfies the triangle inequality 


1 
4 





vi 


lv + wil = lloll + lwll [2] 
Figure 16.4—1 shows the corresponding picture as a triangle composed of the vec- 
tors v, w, and z = v + w. 
To measure the angle of v relative to some other vector w, we first recall that the 
crosscorrelation function measures the similarity between the shifted signal 
v(t + 7) and w(t). Now, setting r = 0, we define the scalar product 


(v, w) = R,,(0) = | v(t)w(t) dt [3] 

which has the properties 
(v, v) = lloll? [4] 
[(, w) | = IHvlllwli [5] 


Equation (5) simply restates Schwarz's inequality, and the equality holds when u(t) 
and w(t) are proportional signals. 


Justification of the scalar product for angular measurement comes from the 
expansion 


lv + wil? = (v + w,u + w) = (u,v) + (vw) + (w, v) + (w, w) 


li 


oll? + 2v, w) + lwli? [6] 


=: 
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Figure 16.4-2 Projection of v on w. 


On the one hand, if v(f) and w(t) are fully correlated in the sense that v(t) = +aw(t), 
then (v, w) = zc |jv|||\wl] and |u + w| = |u|] + |}wl|. Figure 16.4—1 thus reduces to 
colinear vectors, with v parallel to w. On the other hand, if v(t) and w(t) are uncorre- 
lated such that (v, w) = 0, then [v + w|? = [v|? + ||w|? and Fig. 16.4-1 becomes a 
right triangle with perpendicular vectors v and w. Accordingly, we say that 





Orthogonality in signal space implies superposition of energy, since |v|?^ + |w]? = 
E, + Bs 
For the intermediate case of 0 < |(v,w)| < vll w|, we can decompose v 
into the two orthogonal (perpendicular) components shown in Fig. 16.4—2. The pro- 
jection of v on w is defined by the colinear and orthogonal conditions v,, = aw and 
(v — v,, w) = 0, and solving for æ yields 
_ (v. w) 


wS Tw” s 





You can confirm from Schwarz's inequality that |v,| = lv|| as expected. 

Next, as further preparation for the study of optimum detection, suppose we 
have a set of M energy signals denoted by (sj(1); i = 1, 2,..., M). Each of these sig- 
nals can be represented by a vector written in the form 


K 
Si = Anh, t Andy tco aub = S oa, K=M [8a] 
k=1 


where the œ, are vectors and the a, are scalar coefficients calculated from 
o = (Sir Pk) [8b] 


Two important aspects of this representation should be stressed. First, Eq. (8a) is an 
exact expression, rather than a series approximation. Second, each signal is com- 
pletely specified by the scalars in Eq. (8b), so we can work with a set of K coeffi- 
cients oj instead of the entire waveform st). These points will become clearer from 
the signal-space interpretation. 
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The vectors $, in Eq. (8) represent a set of orthonormal basis functions ¢,(t) 
with the properties 


lel? = (be by) =l k= 1,2,...,K [9] 


(dy, $;) =0 TRE 


These basis functions are mutually orthogonal and each one has unit norm—analogous 
to the unit vectors of ordinary two- and three-dimensional spaces. By extrapolation, we 
say that the @, span a K-dimensional subspace Y y of the signal space T. If’, contains 
the signal vector s;, its projection on the basis vector $, is 


(Si bed ll bell? = (si be) be = au, 


where we've used Eqs. (7), (9), and (8b). Therefore, the coefficients œ; are the coor- 
dinates of the vector s; with respect to the $, basis. i 

The subspace I’, and basis $, depend upon the specific set of signals we wish 
to represent. In fact, we usually generate the basis functions directly from the set of 
signals {s,(t)}, thereby ensuring that T x contains all of the signal vectors in question. 
Although a given set of signals defines a unique subspace, we have considerable 
flexibility in the choice of basis functions as long as they span I’, and satisfy Eq. (9). 

Having generated an orthonormal basis for a set of signals, we can compute the 
vector coordinates and express the norms and scalar products in terms of the coordi- 
nates. The scalar product of any two signals is found using Eqs. (8) and (9), with the 
result that 


K 
(si, Sm) E ( > Os > anb ) za 2 ik mk 


Hence, letting m = i we have 


[10a] 


K 
Isl? = (si 5) = >= e [105] 
k= 


which is a K-dimensional statement of the pythagorean theorem. 


The Gram-Schmidt Procedure 


A sequential method for constructing an orthonormal basis is the Gram-Schmidt 
procedure, as follows: 


1. Select from the signal set any s,(t) having nonzero norm, and take $, = sj/|s,|| 
so |g = 1. 

2. Select another signal s(t), compute the coordinate a, = (s2, $), and define 
the auxiliary vector g, = s; — @,, in Fig. 16.4—3a. Since g, is orthogonal to 
$i, we can take $y = &/| ejl. 
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Figure 16.4-3 Vector constructions for the Gram-Schmidt procedure. 


3. Select another signal s;(t), compute the coordinates o4, and a, to define 


83 = 53 — [05,04 + O32] in Fig. 16.4-3b. Then take $3 = 4/|g3l so 3; has 
unit norm and is orthogonal to @, and $». 


4. Continue in this manner until all M signals have been considered. 
The basis resulting from this procedure can be expressed compactly by 
be = 8x/Ilgull [11a] 


with g, — s, and 


k-1 
£ 75,— X ayh 2-ksK [11b] 
j= 


Equation (11b) yields g, = 0 whenever s, has zero norm or equals a linear combina- 


tion of previous signal vectors. Such cases are omitted from the basis, and the 
dimensionality will be K < M. 





three signals in Fig. 16.4—4a. First, noting that ||s,|* = E, = 6°, we take 
p(t) = s,(t)/6 as plotted in Fig. 16.4-4b. Then 


Q5 = | s(t) (t) dt = 3 
—0o 
so g(t) = s(t) — 36,(t), and a sketch of g,(t) quickly yields |g;]? = 3?, and we 
take p(t) = g;(t)/3 in Fig. 16.4—4c. A third basis function is not needed since we 
can write the linear combination s4(f) = s,(t) — s(t). 


By calculating the remaining coordinates, or by inspection of the waveforms, 
we obtain the vector representations 


5, = 69, s = 36, + 36, 53 = 36, — 36, 


which are diagrammed as vectors in Fig. 16.4—4d. This diagram brings out the fact 
that s, and s, are orthogonal and their sum equals s}. 


Let's apply the Gram-Schmidt procedure to obtain an orthonormal basis for the 
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Figure 16.4-4 {a} A signal set; (b} one basis function; (c) another basis function; (d) vector 
diagram. 





EXERCISE 16.4-1 Let another set of signals be defined from Fig. 16.4-4a by taking 
si(t) = VZ st), 3(t) = V2s,(t), and s(t) = —V25,(t). (a) Apply the 
Gram-Schmidt procedure and draw the resulting vector diagram. (b) Calculate 
[55]? using Eq. (10b) and compare with the result from Eq. (1). 











16.5 OPTIMUM DIGITAL DETECTION 


The signal/vector concepts developed in the previous section are used here to design 
optimum receivers for digital signals with AWGN contamination. Vector concepts 
also facilitate the calculation of error probabilities and the selection of signals for 
efficient digital communication. 
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n,(t) 





s(t) 
m —| Transmitter AWGN Receiver 
channel 


Figure 16.5~1 Mary digital communication system. 








Optimum Detection and MAP Receivers 


Consider the M-ary digital communication system in Fig. 16.5—1. The source selects 
a symbol m from the set (mj; i = 1,2,..., M). The transmitter generates a signal 
that appears at the destination as s(t). This signal belongs to the set 
[s(t); i = 1,2,..., M}, in one-to-one correspondence with the symbol set. The 
received signal is corrupted by independent additive white gaussian noise n,(t) with 
zero mean and power spectral density G(f ) = No/2. The receiver produces the esti- 
mated symbol m by comparing y(t) = s(t) + n,(t) with stored copies of the uncor- 
rupted signals. We seek an optimum detection strategy that minimizes the average 
error probability P, = P(m # m). 

To formulate the optimization criterion more precisely, let P(m | y) be the con- 
ditional probability that m was selected by the source given that y(t) has been 
Observed at the destination. Clearly, if 


. P(mj| y) > P(m; | y) allizj [1] 


then the receiver should chose m = m; to minimize P(m # m). The quantities in 
Eq. (1) are known as a posteriori probabilities, meaning “after observation," and 
Eq. (1) defines a maximum a posteriori (MAP) receiver. In the unlikely event of a 
tie, the receiver chooses any one of the m; that maximizes P(m;| y). We'll assume 
equiprobable symbols, in which case the MAP criterion reduces to maximum like- 
lihood detection. 

Our statement of the optimum detection problem contains only two critical 
assumptions: independent additive gaussian noise and stored copies at the 
receiver. The assumption of equiprobable symbols is not critical since this condi- 
tion can be ensured by source encoding, as described in Sect. 16.1. Nor is the 
assumption of a white noise spectrum critical since a nonwhite noise spectrum can 
be equalized by a prewhitening filter at the front end of the receiver. The assump- 
tion that stored copies are available at the receiver essentially means that we're 
working with a coherent system, but we have no preconceptions about the struc- 
ture of the receiver itself. Signal-space analysis eventually determines specific 
receiver implementations. 

All of the M possible signal waveforms will be represented by vectors in a sub- 
space I'« of dimensionality K = M and spanned by some orthonormal basis $,. 
However, the received vector y = s + n, does not fall entirely within Ig. We there- 
fore decompose n, as n, — n + n' shown in Fig. 16.5-2 where, for convenience, 
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Figure 16.5-2 Projection of y = s + n, on Ik. 


Tx is drawn as a plane. The component n’, called the irrelevant noise, is orthogonal 
to I’, in the sense that 


(n,d)-0 k=1,2,...,K 


The remaining noise vector n is contained in l'x, so we can write 


K 
n= > Bx, [2a] 

k=1 

where the noise coordinates f, are given by 


Bx = (n, $4) = (ny TA, dx) = (n,, Px) [2b] 
Finally, we define the relevant data vector 


A f 
v=y-n=st+n [3] 


which is the projection of y on l' since y = v + n' and n’ is orthogonal to every 
vector in I'y. 

Figure 16.5—3 illustrates the vectors in I’, for the case of K = 2 and M = 3. 
The three dots mark the tips of the uncorrupted signal vectors, and v is a typical rel- 
evant data vector when s = s;. Intuitively, this diagram suggests that the receiver 
should choose the signal vector whose tip is closest to the tip of v. Subsequent 
analysis indeed confirms the maximum-likelihood detection rule: 








This rule defines M decision regions in Ix, one region for each signal point. In 
Fig. 16.5—3, for instance, the three decision regions are bounded by dashed lines 
formed by constructing perpendicular bisectors of lines connecting the signal points. 


These decision-region boundaries correspond to the decision thresholds introduced in 
Chap. 11. 
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Figure 16.5-3 Signal vectors and decision regions in Dy with K = 2 and M = 3. 


The foregoing detection rule involves vector norms in Ty. We know the signal 


norms || Si | = VE, but the noise n is a random vector whose properties require fur- 
ther study. From Eq. (2a) and Eq. (10D), Sect. 16.4, we have 
K 
Inl? = SBF 
kzi 


where B, = (n,, $,), which is a linear operation on a gaussian RV. Hence, the 9, are 
also gaussian with mean value 9, = (n,, $,) = 0. We turn then to the variance and 
the question of statistical independence by considering the expectation 


co 


E Bb] = z| | ny(t) db; (t) a| 


-00 —co 


096) da | 


= [ |. EONA di 
A P 
But, from the autocorrelation function of white noise, E[n,(t)n,,(A)] = ES ó(t — A), 
so we get 
Ny2 i=k 
E «| ems 


which means that the 8, are independent gaussian RV's with variance c? = N42. 
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Therefore, the PDF of any noise coordinate 6, = f is the zero-mean gaussian 
function 





eB [20° o? = — [4a] 
To? 2 


P(B) = - 


The joint PDF of all XK coordinates is then p,(B,, Bo- BQ = 


De(Pi)p(B2) > pg(By) since the coordinates are mutually independent. Accord- 
ingly, we express the vector PDF as 


pn) = PalBr Bos --- By) = (NS) F7? e elm [4b] 


which has spherical symmetry in Tx. 

Now we're prepared to analyze optimum detection from the criterion in Eq. (1), 
noting that P(m | y) = P(m |v). Since P(m|v)p,(v) dv = P(m)p,(v | m) dv, we can 
write P(m |v) = P(m)p,(v | m)/p,(v) where the unconditional PDF p,(v) does not 
depend on m. Thus, P(m;| y) > P(m,| y) is equivalent to the vector PDF condition 


P(mj)p.(v | mj) > P(mj)p.(v | mi) [5] 
But when m = m; s = s;jsoU = s; + nand 
p.(v | m;) m D,(U m 5;) T (TN,) *? e o7 


Substituting this expression into Eq. (5) yields, after simplification the MAP detec- 
tion rule 


ilv ben sj? v In P(mj) < lv o sil? — in P(m;) [6a] 


When the symbols are equiprobable, P(m;) — 1/M for all i and Eq. (6a) simplifies 
to the maximum-likelihood detection rule 


lv — sli? < lv — s? [65] 


which is the same as the decision region condition |v — s| < jv — sj]. 

Our next task is to devise a receiver structure that implements the detection rule. 
Recognizing that the receiver must operate on the waveform y(t) instead of the vec- 
tor v, we first write |v — s,|* = (vl? — 2(v, sj) + [sj]? where [sj]? = E; and 
(v, si) = (y — n', sj) = (y, s;) since n’ is orthogonal to any vector in Ix. The term 
lvl]? will appear on both sides of Eq. (6) and cancel out, so we define the observable 
decision function 


ll 


Zi = (y, si) — ci [7a] 


in which 


(5) = | x(t)s(t) dt [7 b] 


~co 
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E; equiprobable m; 
C; — 7d 
2[E; — No In P(m;)| otherwise 


Ni 


The detection rule thus becomes: 











The scalar products (y, s;) needed for optimum detection can be calculated at 
the receiver using the stored signal copies and a bank of correlation detectors, each 
correlator having the form of Fig. 16.5-4a. If the signals are timelimited so that 
s(t) = 0 outside of 0 = t = D, then correlation detection reduces to matched filter- 
ing and sampling as diagrammed in Fig. 16.5—4b. In either case, the operation (y, s;) 
projects y along the direction of s; and strips off the irrelevant noise n’. 

Figure 16.5-5 shows a complete MAP receiver with a bank of correlators or 
matched filters that operate in parallel on y(t) to produce (y, sj), i = 1, 2,..., M. 









y(t) Q, sp 





f SEO hit) = s(D - t) 






sit) 
(a) (b) 


Figure 16.5-4 Scalar product calculation. (a] Correlation detector; (b) matched filter. 
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Figure 16.5-5 MAP receiver with bank of correlators or matched filters. 
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Subtracting the bias terms c; yields the set of decision functions (z;). The receiver 
compares the decision functions to find the largest one, say z;, and chooses the opti- 
mum estimate m = m). 
Various simplifications of this structure are possible, depending upon the spe- 
cific set of signals. When the symbols are equiprobable and all signals have the 
same energy, the bias terms are independent of i and may be omitted so that 
z; = (y, sj). When the signals are such that the dimensionality of Ty is K < M, we 
can use Eq. (8a), Sect. 16.4, to write 


K 
(y, 5) = 2 Gay, bx) [8] 


and the scalar products can be calculated from stored copies of the K basis functions 
p(t) rather than the M signals. Only for this implementation do we need to know 
the basis functions explicitly. Modifications such as these account for the simplified 
receivers shown in previous chapters. 


i 

i 
J 
i 
s 
EI 
3 
4 
a 
E 
4 





EXAMPLE 16.5-1 Suppose a set if M = 8 signals are generated by combined amplitude and phase 
modulation (QAM) such that 


s(t) = —s(t) = A, cos wet 
s(t) = —s(t) = V2 A, cos (wt + 7/4) = A,(cos wt — sin cxt) 
53(t) = —s,{t) = A, cos (wt + 7/2) = ~A, sin et 


sat) = —sg(t) = VIA. cos (wt + 37/4) = —A (cos wt + sin wt) 


All signals have duration D, and the carrier frequency f, is an integer multiple of 
1/D. 

Observing that each signal can be expressed as a linear combination of cos w,t 
and sin w,t, we take the basis functions to be 


h(t) = V2/D cos ot — ét) = — V2/D sin e«t 4 


We thus obtain the two-dimensional signal set in Fig. 16.5-6a, where E, = A2D/2 
and E; = 2E,. Figure 16.5-6b shows the corresponding decision boundaries for 
maximum-likelihood detection. (These diagrams differ from the class of QAM sig- 
nals discussed in Sect. 14.4 which have M = p? points in a square array.) 

Since K « M and the basis functions are known, Eq. (8) leads to the simplified 
optimum receiver portrayed in Fig. 16.5—6c. The computational unit uses the inputs 
(y, $1) and (y, $2) to calculate the decision functions 


Zi — [anly $1) + aly, $5)] SE E/2 1152 2x 
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0, 6) 
— sm 
xD» m 
Q, 2) 
| 28 
Computation Decision 
V2/D sin wt unit unit 
(d 
Figure 16.5-6 (a] A two-dimensional signal set; {b} decision regions; (c) optimum receiver. 


in accordance with Eqs. (7) and (8). When the source produces a sequence of sym- 
bols, the integration and decision process is repeated over each successive symbol 
interval AD = t = (k + 1)D. 


Error Probabilities 


But the structure of an optimum receiver is not much good without knowledge of the 
error probability. So here we formulate general and specific expressions for the 
average error probability in an M-ary system with equiprobable symbols and opti- 
mum detection. 

As a starting point, take an arbitrary symbol m; and let P(e | m;) stand for the 
probability of erroneous detection of the corresponding signal s{t). Averaging over 
the M equiprobable symbols gives 
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M | M 
P, = >\P(m)P(e | m) = — > Ple | m;) [9a] 
j=l M je 
Alternatively, we can write 
1 M 
P,o=1-P, P,-—MWP(c|m) [9b] 


MF 


where P(c | m;) stands for the probability of correct detection. We'll use geometric 
vector arguments to determine P(e | mj) or P(c | mj). 

Figure 16.5—7a represents the situation in lg when s,(t) has been corrupted by 
noise n,(1), resulting in the vector v = 5; n. Also shown is another signal vector s; 
whose tip-to-tip “distance” from s; is dj; = ls; — 5j ||. A detection error occurs if the 
projection of n along the direction of s; — s; exceeds d;/2, so v falls on the wrong side 
of the decision boundary halfway between s; and s;. This error event will be denoted 
by ej. To find the probability P(e,), let the vectors be translated and/or rotated until 
5; — s; becomes colinear with an arbitrary basis vector ¢, as shown in Fig. 16.5—7b. 
Any translation or rotation of signal points and decision boundaries does not effect 
error-probability calculations because p,(n) has spherical symmetry in lg. But now 
we more readily see that P(ejj) equals the probability that B, > d,/2. Hence, 


P(e) = [pea dp, = (5) = TA = (m) [10] 


which follows from Eq. (4a). 

Equation (10) immediately applies to the binary case since there are only two sig- 
nal vectors, which we label s; and s; to be consistent with previous notation. Clearly, 
P(e | mo) = P(eyo) and P(e| m,) = P(eg) = P(e,9), so maximum-likelihood detec- 
tion of a binary signal yields the average error probability 














— —— h 
—y 0] 
By 
aa ee 
AE. 
la) (b) 
Figure 16.5-7 fa] Signal-plus-noise vector; (b) noise vector projection obtained by transla- 


tion and/or rotation of signal points. 
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P, = o( = 2") p] 
VN 

A more familiar expression emerges when we note that |s, — sol? = 
E, — 2(s,, 59) + Eg = 2[E, — (5), 59) |, where E, = (E, + Ej)/2 is the average 
energy per bit. If the signals are polar, so so(t) = —s,(t), then (s, 59) = —E, and 
(lls, — syll/ V2NS)? = 4E,/2No = 27». Equation (11) is therefore identical to 
the result we previously obtained for polar baseband transmission with matched 
filtering. 

In the general M-ary case, Eq. (10) accounts for just one of the boundaries 
between s; and the other M — 1 signal points. A complete expression for P(e | m;) or 
P(c | mj) necessarily involves the specific geometry of the signal set, and it may or 
may not be easy to obtain. If the signal points form a rectangular array, then trans- 
lation and rotation yields rectangular decision regions like Fig. 16.5—8 where we see 
that correct detection of m; requires ~a; < f, < b, and —a, < B; < by. Since 
these orthogonal noise coordinates are statistically independent, we have 


b; 


bi 
P(c | mj) = | Pg(81) as. P&(B2) dB; [12a] 


74 m 


Each integral in Eq. [12a] can be expanded in the form 


i pa(B) dB = 1 — (=) = (=) [12b] 


obtained from the graphical interpretation of the Q function for a gaussian PDF with 
zero mean and variance N,/2. 








Figure 16.5-8 Noise vector coordinates relative to rectangular decision regions. 
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When the geometry of the signal set makes exact analysis difficult, we may set- 
tle instead for an upper bound on the average error probability. To develop an upper 
bound, first consider the case of M = 3; P(e | m,) then equals the probability of the 
union of the error events e;; and e3;, so we write P(e|m,) = P(ej, + e) = 
P(e3) + P(e3,;) — Plen, 631) = P(e) + P(ex;). Extrapolating to arbitrary M and 
m; gives the union bound 





M 
Pe|m)s Dd Ple) i*j 
i=] 
with P(e) as in Eq. (10). There are M — 1 terms in this sum, and a simpler but 
looser bound is P(e | mj) = (M — 1)9(2)/ VIM), where d, stands for the “dis- 
tance" between s; and its nearest neighbor, i.e., 


d, = minlls; — sjl [13g] 
ij 


Then, using Eq. (9a), we have 


eH o : ) [135] 
j M j71 V 2No 


pai US cs stp Suida t c c e c ER rd 
















Which is our final result. 


V aah iN ce gi ya 
ERG E dc Bra ASTU 2a 
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m 
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Consider the signal set back in Fig. 16.5-6b. The decision regions have a rather 
complicated pattern, but the nearest-neighbor distance is d; — 2a — VE, for all 
eight points. We also observe that |s|? = E, = (2a)? for even i, while ||s;]? = 
E, = 2E, for odd i. The average energy per symbol is then 


Vigne sh eT 
fs WS AEN DRE I Sr ES 


E- C X (2a)? + 4 X 22ay] = 62 


7 2a E 
P,S 8 = 7 = 
8 of V E) of Z) 


which is an upper bound on the error probability in terms of the average signal energy. 

Closer examination of Fig. 16.5-6b reveals that the corner points have rectan- 
gular boundaries, equivalent to Fig. 16.5—8 with a, = a; = a and b, = b, = co. 
Therefore, for even values of j, Eq. (12) gives the exact result 


P(c| m) *(1—- a? q = Q(a/ VNy2) = o( V E/3N;) 





Hence, from Eq. (13), 
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Furthermore, for odd values of j, we can omit the triangular area and write 


(elm) > | 


co 


paP.)dB, | pf) dB, = (1 = a)(1 = 22) 


Thus, substitution 1n Eq. (95) yields 


1 5 3 
>> os 2 + iz zz — iu en m 2 
P, 8 [4(1 q) + 4(1- q)1 2q) | pue E ) 74 


which is a more accurate result than the union bound. 





Let aset of M = 6 signals be defined by s, = —s, = aĝi, S3 = —54 =a; + 2abo, EXERCISE 16.5-1 


and s, = —sg = ad, — 2a. Construct the decision boundaries and use Eq. (12) 


to show that P, = (7q — 4q2)/3, where q = O(-V6E/IIN,) with E being the aver- 
age signal energy. 





Signal Selection and Orthogonal Signaling * 


Having learned how to implement optimum detection given a set of signals, we 
come at last to the important design task of signal selection for digital communica- 
tion. We continue to assume equiprobable symbols and gaussian noise, but we now 
add a constraint on the average signal energy E. In this context, we say that: 





Our vector interpretation suggests that the corresponding signal points should be 
arranged spherically around the origin to minimize vector length and signal energy, 
with the largest possible spacing between points to maximize separation and mini- 
mize error probability. These optimal properties define the so called simplex set in 
a subspace with K = M — 1 dimensions. 

The simplex signal points form the vertices of a K-dimensional pyramid whose 
center of gravity coincides with the origin. The pyramid reduces to an equilateral 
triangle when M = 3, and to antipodal points (s; = —5,) when M = 2. When M is 
large, there's little difference between the optimal simplex set and a set of egual- 
energy orthogonal signals. Since orthogonal signals are easier to generate and ana- 
lyze than simplex signals, we'll focus attention on the nearly optimum case of digi- 
tal communication via M-ary orthogonal signals. 
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Specifically, let an M-dimensional subspace be spanned by a set of orthonormal 
basis functions, and let 


s(t) = VEÐ | i-12,..,M [14a] 
so that 
E j-i 
(s 5j) = n pei [145] 
di-]|s-sl?-2kb  jsi [14d 


These relations define a set of mutually orthogonal signals with average signal 
energy È, as illustrated in Fig. 16.5-9 for M = 3. We'll also impose the timelimited 
condition that s(t) = 0 fort < 0 and t > D. The values of D and M are related by 


_ log, M 


Fp D 


where r; is the equivalent bit rate of the information source. 

There are many possible sets of timelimited orthogonal signals. At baseband, 
for instance, the signals could be nonoverlapping pulses of duration 7 = D/M—i.e., 
digital pulse-position modulation (PPM). Or, for bandpass transmission, orthogonal 
signaling may take the form of frequency-shift keying (FSK) with frequency spac- 
ing 1/2D. Regardless of the particular implementation, M-ary orthogonal signaling 
is a wideband method requiring transmission bandwidth 


Mr, 


Bor up eee d 
T= 23D 2lgM ^ 


[15] 


We'll see that this method trades increased bandwidth for decreased error probabil- 
ity, similar to wideband noise-reduction in analog communication. 





Figure 16.5-9 Orthogonal signal set with M = 3. 
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First note from symmetry that P(c | mj) is the same for all M signals, so 
P, = P(c\m,) = P(z; > z, alli # j) 


where z; is the decision function in Eq. (7). Since the bias term c; is independent of i, we 
can drop it and write z; = (y, sj) = (v, s). When m = mpv =s,+n= VEg, +n 
and 


"m E _ [E+ VEB i=j 
Zi 7 (VE Q; tn, VE ¢;) = El; $;) z3 VE(n, ;) m NA i*j 


in which we've introduced the noise coordinates B; = (n, ¢;). 


Next, consider some particular value of B; so, for any i # j, the probability of 
the event z; > z; given D; is 


P(z; > z|Bj) = P(E + VEB; > VEB) = P(B; < VE + Bj) 


VE+B; 
| p(B.) dB; 


—co 


Then, since the noise coordinates are independent RVs and there are M — 1 coordi- 
nates with i # j, the probability of correct detection given B; is 


Lp 


P(c | B) = | Dpg(B;) 5 ] 


—co 


Averaging P(c | B;) over all possible values of £; finally yields 


oo co VEJN +À , M-1 : 
r= | P(c | Bj)ps(Bj) dB, = v "^ | | e"du| e" daii] 


—co —coL *—oo 


where u = B;/ V No and A = Bj/ V No. 
The formidable expression in Eq. (16) has been evaluated numerically by 


Viterbi (1966), and plots of P, = 1 — P, are shown in Fig. 16.5—10 for selected val- 
ues of M. These curves are plotted versus 


E = 8D — = S [17] 
No log; M No log, M Nor, 

where S = E/D is the average signal power. We see that when S/Nor, has a fixed 
value, the error probability can be made as small as desired by increasing M. In fact, 
Viterbi proves analytically that 


lim P, = 


M-—co 


n S/ Nor, > n2 
1 S/Nor, < In 2 
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Figure 16.5-10 Error probability for M-ary orthogonal signaling. 


represented by the dashed line in Fig. 16.5-10. Hence, if M — oo, then orthogonal 
signaling with optimum detection approaches errorless transmission at any bit rate 


S 
SS Es 
No In 2 


yy < 


where C,, is the maximum capacity of an AWGN channel as predicted from infor- 
mation theory. 

But keep in mind that increasing M means increasing both the bandwidth and 
the receiver complexity, which is proportional to M. Furthermore, Fig. 16.5-10 
reveals that orthogonal signaling with fixed M >> 1 has a threshold effect in the 
sense that P, rises abruptly if S/Nor, should happen to decrease somewhat. These 
observations point to the important conclusion that reliable, efficient, and practical 
digital communication usually involves some form of error-control coding as well 
as careful signal selection and optimum receiver design. 
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16.6 PROBLEMS 


Suppose that equal numbers of the letter grades A, B, C, D, and F are given in a cer- 
tain course. How much information in bits have you received when the instructor 
tells you that your grade is not F? How much more information do you need to 
determine your grade? 


A card is drawn at random from an ordinary deck of 52 playing cards. (a) Find 
the information in bits that you receive when you're told that the card is a heart; a 
face card; a heart face card. (b) If you're told that the card is a red face card, then 
how much more information do you need to identify the specific card? 


Calculate the amount of information needed to open a lock whose combination con- 
sists of three integers, each ranging from 00 to 99. 


Calculate H(X) for a discrete memoryless source having six symbols with probabil- 
ities 


Py= 1/2 Pp=1/4 Pe-1/8 Pp=Pe=1/20 Pp= 1/40 


Then find the amount of information contained in the messages ABABBA and 
FDDFDF and compare with the expected amount of information in a six-symbol 
message. 
Do Prob. 16.1-4 with 

P,=04 Pg =0.2 Pe =012 Pp =Pe=01 Pr = 0.08 


A certain source has eight symbols and emits data in blocks of three symbols at the 
rate of 1000 blocks per second. The first symbol in each block is always the same, 
for synchronization purposes; the remaining two places are filled by any of the eight 
symbols with equal probability. Find the source information rate. 


A certain data source has 16 equiprobable symbols, each 1 ms long. The symbols 


are produce in blocks of 15, separated by 5-ms spaces. Find the source information 
rate. 


Calculate the information rate of a telegraph source having two symbols: dot and 


dash. The dot duration is 0.2 sec. The dash is twice as long as the dot and half as 
probable. 


Consider a source with M = 3. Find H(X) as a function of p when P, = 1/3 and 
P, = p. Also evaluate H(X) when p = 0 and p = 2/3. 


Consider a source with M > 2. One symbol has probability a << 1/M, and the 
remaining symbols are equally likely. Show that H(X) ~ log (M — 1) + alog (1/a). 
Obtain the bounds on K in Eq. (13) (p. 706) when N; satisfies Eq. (15). l 


Obtain the Shannon-Fano code for the source in Prob. 16.1—4, and calculate the effi- 
ciency. 


Show that there are two possible Shannon-Fano codes for the source whose symbol 
probabilities are given in Prob. 16.1—5. Then calculate the efficiency of the better 
code. 
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TEEARU hl 


16.1-14 Consider a source with M = 3 and symbol probabilities 0.5, 0.4, and 0.1. 
(a) Obtain the Shannon-Fano code and calculate its efficiency. (b) Repeat part a 
for the second-extension code, grouping the symbols in blocks of two. 


16.1-15 A binary source has symbol probabilities 0.8 and 0.2. (a) Group the symbols in 
E blocks of two, obtain the corresponding second-extension Shannon-Fano code, and 
calculate its efficiency. (b) Repeat part a with blocks of three for a third-extension 

code. 


16.1-16* A binary source has Py = P, = 1/2 and first-order memory such that P(0|1) = j 
P(1 |0) = 3/4. Calculate the resulting conditional entropy. 


16.2-1 The joint entropy of a discrete system is defined as 


1 
H(X, Y) = >, P(xiy;) log 57 
> ^ Puy) 
Show that H(X,Y) = H(Y) + H(X|Y). 
16.2-2 Expand Eq. (2) (p. 715) to obtain 1(X; Y) = H(X) + H(Y) — H(X, Y), where 
H(X, Y) is the joint entropy defined in Prob. 16.2-1. 


16.2-3 Consider a noiseless system with u source symbols, u destination symbols, and for- 
ward transition probabilities 


ofi que 
POs =) = to j*i 
Show that H(Y) = H(X), H(Y|X) = 0, and therefore I(X; Y) = H(X). Hint: Write 
the sums with indices 0 - i - y and 0 x j x y. 
16.2-4 Figure P16.2-4 represents a nonsymmetric binary channel. Follow the method used 
in Example 16.2—1 (p. 716) to obtain 


AX; Y) = Q(B + (1 — a — B)p] - pO(a) ~ (1 ~ pXXB) 


l-a 
P(x)-p Xi M CN EEEREIDMD MEE E » 
Supr w B 
"m Ua 
P@)=1-p ey 


1-8 
Figure P16.2-4 








16.2-5 The channel in Prob. 16.2—4 is said to be useless if B = 1 — a. Justify this name 
from intuitive and analytical arguments. 
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When a and b are constants, show that 


d ba 
—Q(a + bp) sph E 
dp a + bp 


Then apply this relation to confirm that I(X; Y) in Eq. (8) (p. 717) is maximum when 
p = 1/2. 
The channel in Prob. 16.2—4 is called a Z channel when 8 = 0. Use the relation in 


Prob. 16.2-6 to show that if a = 1/2, then I(X; Y) is maximum when p = 2/5. 
Then evaluate C;. 


Let the channel in Prob. 16.2-4 have a = 1/4 and B = 0. Use the relation in 
Prob. 16.2-6 to find C,. 


The binary erasure channel has two source symbols, 0 and 1, and three destination 
symbols, 0, 1, and E, where E denotes a detected but uncorrectable error. The for- 
ward transition possibilities are 


P(0]O -1-«a P(E|0)=a Pül|0)-0 

P(0|1) =0 P(E1)-o Pü|l)-21-a 
It follows from symmetry that /(X; Y) is maximum when the source symbols are 
equiprobable. Find C, in terms of a. 
Derive the bound on P(n = d) given in Eq. (145) (p. 721). 


E 1 
Find H(X) in terms of S = x? and confirm that H(X) < 5 log 27reS when p(x) is a 
uniform PDF over |x| <a. 


1 
Find H(X) in terms of S = x? and confirm that H(X) < = 5 log 2areS when p(x) is a 
Laplace PDF in Example 8.3-1. 


Find H(X) in terms of S =x? and confirm that H(X) < 5 to 2meS when 
p(x) = ae "u(x). 
Express H(Z) in terms of H(X) when Z = aX tb 


Find p(x) that maximizes H(X) and determine the resulting H(X), given the signal 
constraints x(t) = O and x(t) = m. 


Find p(x) that maximizes H(X) and determine the resulting H(X), given the signal 
constraints x(t) = 0 and x *(t) = S. 


Write p,(x|y)/p.(x) = [px)py(y)/po(™ y) ] ! and use In v = v — 1 to show that 
I(X; Y) = 0. 

Consider an AWGN channel with S/N; = 10*. Find the maximum rate for reliable 
information transmission when B = 1 kHz, 10 kHz, and 100 kHz. 


Consider an AWGN channel with B = 3 kHz. Find the minimum value of S /N in dB 
for reliable information transmission at R = 2400, 4800, and 9600 bits/sec. 
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CHAPTER 16 e Information and Detection Theory 


Consider an AWGN channel with B = 1 kHz and No = 1 wW/Hz. Find the mini- 


mum value of S in milliwatts for reliable information transmission at R = 100, 
1000, and 10,000 bits/sec. 


An engineer claims to have designed an analog communication system that has 
(S/N)p = 60 dB when Sp/N By = 5 dB and B; = 10W. Do you believe this claim? 


An ideal system has (S/N)p = 40 dB when B; = 4W. Find (S/N)p when B; is 
tripled while all other parameters are held fixed. 


Do Prob. 16.3-12 with B; reduced to W/2 instead of being tripled. 


A communication system has By = 12 kHz, L = 60 dB, and Nọ = pW/Hz. Find 
the minimum value of Sy needed to get (S/N), = 30 dB when W = 3 kHz. 


Do Prob. 16.3-14 with W = 6 kHz. 
Consider the signals v(t), w(t), and z(t) = v(t) + w(t) with energy E, = E, = 4 


and E, = 8. Construct a vector diagram based on this information, and use it to find 
E, when x(t) = 3v(t) — 0.5w(t). 


Do Prob. 16.4—1 with E, = 4, E, = 12, and E, = 16. 

Prove the triangle inequality in Eq. (2) (p. 736) from Eqs. (5) and (6). 

Derive Eq. (7) (p. 737) and show that |v,,|| = ||v|. 

Apply the Gram-Schmidt procedure to obtain an orthonormal basis for the signals 

s(t) =1 sft) =t s(t) = 2 

defined over —1 = t = 1. Then write the signal vectors in terms of the basis func- 

tions. 

Apply the Gram-Schmidt procedure to obtain an orthonormal basis for the signals 
s(t) = 1 s(t) = cos mt/2 s(t) = sin wt 

defined over —1 = ¢ = 1. Then write the signal vectors in terms of the basis func- 

tions. 


Consider the following three sets of binary signals defined in terms of orthonormal 
basis functions $, and ¢: 


(a) s, = ag, Sz = aQ 
(b) s, = ad, s = ap 
(c) s= ad, s;—0 


Construct the maximum-likelihood decision regions and write the decision func- 


tions for each signal set. Then simplify and diagram optimum receivers using corre- 
lation detection. 
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Consider the following two sets of quaternary signals (M = 4) defined in terms of 
orthonormal basis functions $, and $;: 

(a) s = ag, s,— ad, 53 = —adg, $, = ~ag 

(b) sı = ad, s =a, s, = ald, t dj) 54 = 0 

Construct the maximum-likelihood decision regions and write the decision func- 


tions for each signal set. Then simplify and diagram optimum receivers using corre- 
lation detection. 


lí 


li 


Consider the following set of quaternary signals (M = 4) defined in terms of ortho- 
normal basis functions $; and $»: 


5, = —$5 = a(Qi + $3) s2 = —s, = alı — $5) 


Construct the maximum-likelihood decision regions and find the error probability 
with AWGN, expressing your result in terms of the average signal energy E. 


Suppose the signal s, = 0 is added to the set in Prob. 16.5-3. Construct the maximum- 
likelihood decision regions and obtain an upper bound on the error probability with 
AWGN, expressing your result in terms of the average signal energy E. 


Suppose the signal sy = 0 is added to the set in Fig. 16.5—6a (p. 747). Construct the 
maximum-likelihood decision regions and find the error probability with AWGN, 
expressing your result in terms of the average signal energy E. Compare with the 
upper bound obtained from Eq. (13) (p. 750). 
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APPENDIX 761 


1 loss usually results in a very feeble signal at the input of a communication receiver. Consequently, to 
obtain an adequate signal level for further processing, the "front end" of a typical receiver includes several stages 
of amplification. But high-gain amplifiers amplify any noise that accompanies the received signal, and they also add 
their own internally generated noise. An accurate assessment of system performance must therefore take into account 
amplifier noise. 

This appendix starts with circuit and device noise, as preparation for the description of amplifiers and other noisy 
Wworport networks. Then we analyze noisy wo-ports connected in cascade, a configuration that pertains to repeater 


systems as well as to receivers. Our analysis brings out the critical factors in receiver noise and the design principles 
for low-noise systems. 


OBJECTIVES 








After studying this appendix and working the exercises, you should be able to do each of the following: 


1. Calculate the mean square noise voltage or current and the available power density for a passive network with 
internal resistance noise. 

2. Use the concepts of available power gain, effective noise temperature, and noise figure to analyze the performance 
of a noisy two-port network. 

3. Calculate the overall noise figure and noise temperature for a cascade system, and use them to find (S/N), given 
(S/N). 


Circuit and Device Noise 


Noise generated within an electrical circuit may come from numerous sources 
involving several different physical phenomena. Here we'll describe major types of 
device noise found in communication systems, and we'll develop appropriate circuit 
models and analysis methods. 

The assumption of stationarity is reasonable for most electrical noise processes, 
thereby allowing us to represent noise sources in the frequency domain. For con- 
venience, we'll adopt the common practice of working entirely with positive fre- 
quency and one-sided frequency functions—as distinguished from the two-sided 
functions used in Sect. 9.3. To clarify this distinction, let G(f) be the two-sided 
available power spectrum of some noise source. Since G(f ) has even symmetry, the 
corresponding one-sided available power density will be defined by 


nlf) = 2G(f) f= 0 [al 


and the total available noise power is 


N= | v df 116] 
0 


Likewise, the one-sided mean square voltage or current densities equal twice the 
two-sided frequency functions for f = 0. 

Figure A-1a shows the frequency-domain Thévenin circuit model of a station- 
ary but otherwise arbitrary noise source with noiseless internal impedance Z(/). 
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vA f) 
la] (b) 
Figure A-1 Frequency-domain models of a noise source. (a) Thévenin circuit; (b} Norton circuit. 


The function vY(f) represents the open-circuit mean square voltage density, 
defined such that 


ues o) [2] 


Re [Z(f)] 
Alternatively, an ideal voltmeter with bandwidth df centered at some frequency f 
would measure the rms value V vA(f ) df across the open terminals of this source. 


Converting Fig. A-1a to a Norton equivalent circuit gives the circuit model in 
Fig. A-1b, with 
itf 
ay) = CD. s 
1Z0)| 
which represents tbe short-circuit mean square current density. Equation (3) also 
expresses Ohm's law in the form needed for circuit noise analysis. 
Suppose the source in question happens to be a thermal resistance R at tem- 


perature J, so G(f) = kT/2 from Eq. (5), Sect. 9.3. Then 
nf) = kT [4a] 
and setting Z(f) = R yields 
UXf) =4RkKT  iXf) =4kT/R [4b] 
These constant densities correspond to white noise, at least up to infrared frequen- 
cies. For computational purposes, the quantity kJ can be rewritten as 
kT = KIAT /9,) ~ 4 x 107 (9/9;,) 


where we've inserted the standard room temperature Fy = 290 K. 

Resistance noise occurs in almost all circuits, but reactive elements may alter 
the frequency density. In particular, let Fig. A~2a be a one-port (two-terminal) network 
containing only resistance, capacitance, and inductance, and having the equivalent 
impedance Z(f) = R(f) + jX(f ). When the resistances are in thermal equilibrium 
at temperature J, Nyquist's formula states that 


v2(f) = 4 R(f )kT | [5] 
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Figure A-2 


Hence, the mean square voltage density takes the shape of the equivalent resistance 
R(f). Equation (5) includes the special case of an all-resistive network whose 
equivalent resistance will be independent of f. 

We prove Nyquist's formula by considering the complete circuit in Fig. A-2b, 
where the source has been connected to a load resistance R, at the same tempera- 
ture J. This load resistance, of course, generates thermal noise with v2(f) = 
4 R;,kSj. Since the two noise sources are physically independent, superposition 


applies and we can calculate the mean square current densities in each direction, 
namely, 


vif) P ART 
"Lex mura UE Gor ar ale 
IZ) + Ri] IZ) + Ri 
The average power delivered from the source to the load in any frequency band df 
is R;Z (f) df, while the load delivers R(f )i2(f) df back to the source. But the net 


power transfer must be zero at every frequency for the circuit to be in thermal equi- 
librium. Therefore, 


il) = 


Rwf) df _ RP) RAT df 
ZA) + m^ (ZA) + RP? 


and Eq. (5) follows after cancellation. 

Nyquist’s formula does not hold when the resistances are at different tempera- 
tures or the network contains nonthermal sources. However, such cases are easily 
analyzed provided that the sources are independent. You simply use superposition 
and sum the mean square values to find the resulting frequency density. 

The most common type of nonthermal noise in most electrical circuits is shot 
noise. This phenomenon occurs whenever charged particles cross a potential barrier— 
as in semiconductor junctions or vacuum tubes. Small variations of kinetic energy 
among the individual particles cause random fluctuations of the total current, rough- 
ly analogous to the sound of a stream of buckshot falling on a drum. Schottky first 
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Figure A-3 (a) Semiconductor junction diode; (b) noise model. 


studied this effect in a vacuum-tube diode operated under temperature-limited con- 
ditions. He found that shot noise could be represented by a current source with 


P(f) = 2gI [6] 


in which q is the electronic charge (1.6 X 10^? coulombs) and 7 is the dc current. 
Since Eq. (6) is independent of both frequency and temperature, shot noise is non- 
thermal white noise. Subsequent investigations have shown that i2(f) actually 
decreases at frequencies above f ^ 1/7, where r denotes the average particle tran- 
sit time. 

Schottky’s result also holds for the semiconductor junction diode in Fig. A—3a. 
The net dc current consists of two components, given by the diode equation 


peque [7al 


where V is the junction voltage and J, is the reverse saturation current. The two cur- 
rent components produce statistically independent shot noise, so the total mean 
square noise current density becomes 


(f) = 2gI,e*"9 + 2a], = 2q(I + 21) [75] 


Figure A-3b shows the complete noise source model, including the diode's dynamic 
resistance 


1 ky . 


Ue TIVA ql 


[Zc] 


Unlike ohmic resistance, dynamic resistance is noiseless since it does not corre- 
spond to any power dissipation. 

Now consider the junction field-effect transistor (JFET) in Fig. À-4a, which 
has a reverse-biased junction between the gate terminal G and the semiconductor 
channel from the source terminal S to the drain terminal D. Figure A-4b shows a 
simplified noise equivalent circuit with current sources representing the gate shot 
noise and the channel terminal noise, given by 


Wf) = 2ql, if) = 4k9 X $8, 


where g,, is the transconductance. When external circuitry connects to the gate and 
source terminals, the shot noise is amplified via the controlled current generator 
£,,V,, and adds to the output noise at the drain terminal. 
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Figure A-4 (a) Junction field-effect transistor; {b} noise model. 


An FET with an insulated gate structure avoids junction shot noise. But a bipo- 
lar junction transistor (BJT) has two semiconductor junctions and two sources of 
shot noise in addition to thermal noise from internal ohmic resistance. Besides ther- 
mal and shot noise, all transistors generate certain types of nonwhite noise that may 
or may not be significant, depending upon the particular device and the application. 

Transistors, vacuum tubes, and other devices exhibit a low-frequency phenom- 
enon known as flicker noise—often called “one-over-f” noise because the mean 
square density is proportional to 1/f” with v ^ 1. Some semiconductor devices also 
produce burst or *popcorn" noise, whose waveform resembles the random tele- 
graph wave in Fig. 9.2—3a, (p. 366). Hence, the mean square density eventually falls 
off as 1/f^. Figure A—5 illustrates the frequency variation that results when i2(f) 


m ey, 






Burst noise 
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Figure A-5 Frequency variation of semiconductor noise with flicker and burst noise. 
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consists of flicker, burst, and white noise. Flicker and burst noise pose serious prob- 
lems for low-frequency applications, but they usually can be ignored at frequencies 
above a few kilohertz. At much higher frequencies, capacitive coupling and various 
other effects tend to increase noise in electronic devices. 





Vacuum diode noise generator 
Noise generators are used in the lab to study noise characteristics, and Fig. A-6a 
gives the schematic diagram of a noise generator employing a temperature-limited 
vacuum diode. The choke coil and blocking capacitor serve to separate the dc and 
and shot-noise currents, as indicated. 

The corresponding noise circuit model in Fig. A-6b includes the diode's noise- 
less dynamic resistance r and the noisy thermal resistance R, presumed to be at room 
temperature J. The equivalent impedance is the parallel combination 


_ R(r + 1/joC) 
Aa) R + (r + 1/joC) 


but the nonthermal shot noise precludes the use of Nyquist’s formula. Instead, we 
convert the shot-noise current and dynamic resistance into Thévenin form shown in 


Choke coil 





r 41/joC 


vi( f) = r2gl 





Figure A-6 Circuit diagram and noise models of a vacuum diode noise generator. 
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Fig. A-6c. Now, applying superposition, the total short-circuit mean square current 
density 1s clearly seen to be 
r?2gI 4k9g 


2 RTT ee 
int?) |r + l/jeC|? R 





and therefore . 
l R?(r°2ġI) R?|r + 1/joC|? 4X9, 

vf) = |ZU) PRE) = ———— t n — 
|R +r + 1/joC| IR trc d/joC|^ R 


Note that the first term is the open-circuit mean square voltage density produced by 
vi(f ) when Ż(f) = 0, while the second term is produced by (f) when vi(f) = 0. 

In normal operation of the noise generator, we're not concerned with the noise 
at low frequencies (which would probably be dominated by flicker noise), and the 
dynamic resistance r is much greater than R. Under these conditions, our results 
simplify to 

Z(f) =R vXf) = 2Rgl + 4 Rk9, 
over the frequency range 
1/2mrrC < f< lc 


Tubes designed for this application have very short transit times, typically r = 1 ns, 
so the shot noise remains constant up to about 1 GHz. Finally, from Eq. (2), the 
available noise power density is 
n(f) ^ $qIR + kTy = KT, + To) 
where 
9. = gIR/2k 


which we interpret as the equivalent noise temperature of the shot noise. The volt- 
age applied to the filament of the tube controls J and thus controls 2,, whose value 
can be determined from an ammeter reading. 





EXERCISE A-1 


Consider the network in Fig. A-7, where R, and R, are thermal resistances at differ- 
ent temperatures. Let R, = R, = R and L = R/27. Obtain expressions for vA( f) 
and n( f). Then simplify your results when J, = J, = F. 








Figure A-7 
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Amplifier Noise 


A detailed circuit model showing all the individual noise sources within an amplifi- 
er would be very complicated and of little practical value to a communications engi- 
neer. Consequently, alternative methods have been devised for the analysis of noise 
in amplifiers. Two particularly useful measures of amplifier noise are the effective 
noise temperature and the noise figure. Both of these measures involve the concept 
of available power gain, which we introduce first. 

Let Fig. A-8 be the circuit model of a noiseless amplifier inserted between a 
source and a load. (For simplicity, we've omitted any reactances that might be asso- 
ciated with the source, amplifier, or load impedances.) The amplifier itself is char- 
acterized by an input resistance r;, output resistance r,, and voltage transfer function 
H(f). The source generates a mean square voltage density v2(f )—Trepresenting 
noise or an information signal or both—and the available power density from the 


source is mf) = v2(f£)/4 R,. The available power density at the output of the 
amplifier is 





2 252 z r, 
nlf) = EO -EO A) anI y 


4r, 4r, ~ 4r, R, +7; 


o 


vf) 


We define the amplifier’s available power gain g,( f) as the ratio of these available 
power densities, Le., 


ad f) 4 nf) = vf )R, ( |H(f) | 3E [8] 


"(f) vXfy, NR, *n 


The actual power gain of an amplifier equals the available power gain when the 
impedances are matched to obtain maximum power transfer at input and output. 

We'll assume hereinafter that the source generates white noise, thermal or non- 
thermal, with noise temperature J,. Then nf) = kJ, and the available noise 
power at the output of a noiseless amplifier would be 


"Xf ) = gf INSS) = e KT, 


But a noisy amplifier contributes additional internally generated noise. Since the 
internal noise is independent of the source noise, we write 


nf) = Bal f RT, + Tli f ) 19] 


Fo 





Figure A-8 Circuit model of a noiseless amplifier. 
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mf) = ko, 


Nin S) 
{a) 
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——— (à By No 
k3, 
(b) 


Figure A~9  Blockdiagram representation of a noisy amplifier. 


where Nin( f ) stands for the available power density of the internal noise seen at the 
output. Figure A-9a depicts Eq. (9) in the form of a block diagram. Integration then 
yields the total available output noise power 


Nie | nol) g=] s) dP* [| mU) df 
0 0 0 


an expression that calls for some simplifications. 
Most amplifiers in a communication system have a frequency-selective 


response, with maximum power gain g and noise equivalent bandwidth By. These 
parameters are related to g,(f) by 


gBy — | ga f£) df [10] 
0 


so the first term of N, reduces to KJ ,gBy. Next, to simplify the second term, we 
define the effective noise temperature of the amplifier to be 


= 


1 oo 
J, = —_ [11] 
e i d 
a Taf ) if 
Hence, the total output noise power becomes 
N, = K9,gBy + gkByT, = gk(9, + T.)By [12] 


diagrammatically portrayed by Fig. A-9b. This diagram brings out the fact that 9, 
represents the internal noise referred to the input and thereby expedites calculations 
of signal-to-noise ratios. 

Now let Fig. A-10 represent a noisy amplifier with signal plus white noise at 
the input. The available signal power from the source is S,, and the signal spectrum 
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nsf) = ko, 


: ed ies 
Je 
(S/N), 


Figure A-10 — Noisy amplifier with input signal and noise. 


falls within the passband of the amplifier, so the available signal power at the out- 
put will be S, = gS,. Thus, using Eq. (12), the output signal-to-noise ratio is 


(=) E E eos [13] 
N/, N, Kk(9,*9,)By 


where the gain g has cancelled out in numerator and denominator. Although the 
source noise does not necessarily have a defined bandwidth, the source signal-to- 
noise ratio 1s taken by convention to be 


EE 
N/; k9g,By 
which just corresponds to the signal-to-noise ratio produced by an ideal noiseless 


filter with unit gain and bandwidth By. However, from Eq. (13), the noisy amplifier 
produces 





(S). raan res) O 
NJs Bog 9 KU T/A RNS 
and we see that (S/N), < (S/N),. 

We also see that the degradation of signal-to-noise ratio due to a noisy amplifier 
depends on the value of effective noise temperature relative to the source noise tem- 
perature. In particular, if J, << F, then (S/N), = (S/N),; under this condition the 
internal noise has little effect, and the amplifier appears to be noiseless. This condi- 
tion holds primarily at carrier frequencies below about 30 MHz, whereas J, 
becomes significant at higher frequencies. Such considerations often affect the 
design of receivers and repeaters. 

When impedances are not matched at the input or output all signal and noise 
powers will be less than the available powers. Nonetheless, Eqs. (13) and (14) 
are still valid because they express power ratios measured at specific points, so 
the impedance mismatch factor cancels out along with the gain. The effective 
noise temperature is therefore a significant parameter, irrespective of impedance 
matching. 

Another measure of amplifier noise is the noise figure F. (This 1s also called 
the noise factor and symbolized by np or NF. Some authors reserve the term “‘noise 
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figure" for the value of the noise factor expressed in dB.) The noise figure is defined 


such that 
S 1/85 
(3). d (3) when J, = J [15] 
Since S, = gS, and (S/N), = S,/kJ By when J, = Fo, 
N, 3, 





= = 1 [16] 
gkT oBy To 


and conversely 
J, =(F-1) [17] 


A very noisy amplifier has J, >> J, and F >> 1—in which case we usually 
express the value of F in decibels. Then, if the source noise is at room temperature, 
Eq. (15) says that (S/N), in dB equals (S/N), in dB minus F in dB. A low-noise 
amplifier has J, < Jy and 1 < F < 2—in which case we usually work with effec- 
tive noise temperature and calculate (S/N), from Eq. (13). 

Table A—1 lists typical values of effective noise temperature, noise figure, and 
maximum power gain for various types of high-frequency amplifiers. Many low- 
noise amplifiers have cryogenic cooling systems to reduce the physical temperature 
and thus reduce internal thermal noise. Other amplifiers operate at room tempera- 
ture, but the internal noise comes from nonthermal sources that may result in 
J, > To- 

Equation (16) defines the average or integrated noise figure in the sense that N, 
involves the integral of ņn,( f) over all frequency. But sometimes we need to know 


Table A-1 Noise parameters of typical amplifiers 


Type Frequency 3,,K F,dB g, dB 
Maser 9 GHz 4 0.06 20-30 
Parametric Amplifier 
Room temperature 9 GHz 130 1.61 10-20 
Cooled with liquid N; 6 GHz 50 0.69 
Cooled with liquid He 4 GHz 9 0.13 
FET Amplifier 
GaAs 9 GHz 330 3.3 6 
6 GHz 170 2.0 10 
1 GHz 110 1.4 12 
Silicon 400 MHz 420 3.9 13 
100 MHz 226 2.5 26 
Integrated Circuit 10.0 MHz 1160 7.0 50 


4.5 MHz 1860 8.7 5 
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Figure A-11 Two-generator circuit model of amplifier noise. 


how the internal noise varies with frequency. The spot noise figure F(f) contains 
this information in the form 


F(f) 4 mi when 9,- 9, [18] 


The value of F( f ) at a particular frequency serves as an estimate of the noise figure 
when a wideband amplifier is used for narrowband amplification with the help of 
additional filtering. 

Finally, to relate system and circuit models of amplifier noise, we should men- 
tion the two-generator model diagrammed in Fig. A-11. This circuit model repre- 
sents the internal noise in terms of fictitious voltage and current sources at the input 
terminals of an equivalent noiseless amplifier. The total mean square voltage density 
at the open-circuited output is 


H r; \? 
atr) = (DU) a rag, + tr) + BUI 


provided that v2(f ) and i2(f) are uncorrelated. Using Eqs. (8) and (18) with n,(f) = 
v2(f )/4r,, we find that 


2 2:2 
vf) + Rif) 
De ————————— [19] 
Y) 4 R kTo 
which shows that the spot noise figure depends in part upon the external source 
resistance. Hence, optimizing F(f) at a particular frequency often requires a trans- 
former to obtain the optimum source resistance. 


Measuring absolute noise power is a difficult chore, so clever techniques have been 
developed for amplifier noise measurement with a relative power meter connected 
at the output. One technique utilizes a calibrated source of white noise, such as a 
diode noise generator, impedance-matched to the input of the amplifier. The proce- 
dure goes as follows. 
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First, set the source noise temperature at J, = Jy and record the output meter 
reading N,. From Eq. (12), this value corresponds to 


N, = CN, = Cgk(9, + 9,)By 


where the proportionality constant C includes any impedance mismatch factor at the 
output. Second, increase the source temperature to J, = J, + Fy such that tbe 
meter reading has doubled and 


N = Cgk[(9, T To) + 3,]By = 2N, 
Then N;/N, = (9, + Fo + F.)/(To + 8.) = 2, so 


T,=, -h F-9J9, 


Note that we don’t need to know g, By, or the constant C. 





An amplifier with g = 60 dB and By = 2 MHz has N, = 40 nW when the source 


noise is at room temperature. (a) Find the effective noise temperature and noise 
figure. (b) Calculate the source temperature needed for the second step of the 
measurement procedure in Example A-2. 


EXERCISE A-2 
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System Noise Calculations 


Here we take up the analysis of cascade-connected systems that include amplifiers 
and other noisy two-port networks. Our objective is to develop expressions for the 
overall performance of the system in terms of the parameters of the individual 
stages. 

First, we must give consideration to lossy two-port networks such as transmis- 
sion lines and connecting cables. Power loss implies dissipation by internal resis- 
tance. Consequently, the internal noise is thermal noise at the ambient temperature 
Tamb and Nil f) = KF amp. However, we cannot use the model in Fig. A-9a because 
lossy two-ports are bilateral, meaning that a portion of the internal noise flows back 
to the input. When impedances are matched, a bilateral two-port has constant gain 
g < 1 in both directions, so &Nim(f ) flows back to the input while (1 — g)n(f) 


goes to the output. The total available noise power in bandwidth By at the output 
thus becomes 


N, = gkT By 23 (1 = g)k9 By x gk[9, + (L E 1) Damp |B iy 
where 
L= 1/g 


which represents the transmission loss or attenuation. Comparing our expression for 
N, with Eq. (12), we obtain the effective noise ternperature 


T, — (L - 1)T amb [20a] 
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Figure A-12. Cascade of two noisy two-ports. 


and Eq. (16) gives the noise figure 

F=1 + (61) al Io) [20b] 
If a lossy two-port is at room temperature, then Jambe = Jo and Eq. (20b) reduces to 
F= lL, 

Next, consider the cascade of two noisy two-ports in Fig. A-12, where sub- 
scripts identify the maximum power gain, noise bandwidth, and effective noise tem- 
perature of each stage. We reasonably assume that both stages are linear and time 
invariant (LTI). We further assume that the passband of the second stage falls with- 
in the passband of the first stage, so B, = B, and the overall noise bandwidth is 
By ~= Ba. This condition reflects the sensible strategy of designing the last stage to 


mop up any remaining noise that falls outside the signal band. The overall power 
gain then equals the product 


$ = 8182 
since the first stage amplifies everything passed by the second stage. 
The total output noise power consists of three terms: 
1. Source noise amplified by both stages; 
2. Internal noise from the first stage, amplified by the second stage; 
3. Internal noise from the second stage. 
Thus, 


9. 
N, = gk3,By + gi(gyk 9,By) + gk TaBy = a(s, i sh 
1 


and the overall effective noise temperature is 








J, 
P = Ji + d 
& 
The overall noise figure is 
Ja 35 F, ax 1 
Jo 8149 8i 


which follows from the general relationship F = 1 + 9/9. 
The foregoing analysis readily generalizes to the case of three or more cascaded 
LTI two-ports. The overall effective noise temperature is given by Friis’ formula as 
To -J 
G,=F5,+ 4+ H ee [21] 
& 8182 





APPENDIX 


and the overall noise figure is 


[E 1 F, ins l 
F-F + Tor [22] 
$1 $182 


Both expressions bring out the fact that 














On the one hand, suppose the first stage is a preamplifier with sufficiently large 
gain g, that Eq. (21) reduces to J, ~ JJ. The system noise is then determined pri- 
marily by the preamplifier. The remaining stages provide additional amplification 
and filtering, amplifying the signal and noise without appreciably changing the 
signal-to-noise ratio. The design of low-noise receivers is usually based on this pre- 
amplification principle. 

But, on the other hand, suppose the first stage happens to be a connecting cable 
or any other /ossy two-port. From the noise viewpoint, the attenuation is twice 
cursed since g; = 1/L, < 1 and J, = (L, — 1)9,,,. Equation (21) thus becomes 


ae TS sap + LJ, + L193/ 82 S ien 


which shows that Z, multiplies the noise temperatures of all subsequent stages. 

Now consider a complete communications receiver as drawn in Fig. A-13a. 
The receiver has been divided into two major parts: a predetection unit followed by 
a detector. The detector processes the amplified signal plus noise and carries out a 
nonlinear operation, i.e., analog demodulation or digital regeneration. These opera- 
tions are analyzed in Chaps. 10, 11, and 14, making the reasonable assumption that 
the detector introduces negligible noise compared to the amplified noise coming 
from the predetection unit. We're concerned here with the predetection signal-to- 
noise ratio denoted by (S/N). 

The predetection portion of a receiver is a cascade of noisy amplifiers and other 
functional blocks that act as LTI two-ports under the usual small-signal conditions. 
Hence, as indicated in Fig. A-13a, the entire predetection unit can be characterized 
by its overall effective noise temperature calculated from Eq. (21). (When the pre- 
detection unit includes a frequency converter, as in a superheterodyne receiver, its 
conversion gain takes the place of available power gain.) For a well-designed receiver, 
the predetection noise bandwidth essentially equals the transmission bandwidth By 
required for the signal. If the available signal power at the receiver input is Sg and 
the accompanying noise has temperature Jp, then Eq. (13) becomes 


(S/N) p = Sp/NoBr [23a] 
with 


No Ê (Tg + Te) = kT y [23b] 


775 


776 





EXAMPLE A-3 


APPENDIX 
nef | KZ p ice ice unit 
Lm. 
8, MAE Detector pem 
i 
: (a) 
Ny = kT y 
Sa | 8. Br (S/N)  Sa/NgBr 

(b) 

Figure A-13 [a] Communication receiver; [b] noise model of predetection unit. 


The sum Ty = Tp + 9, is called the system noise temperature, and Np represents 
the total noise power density referred to the input of an equivalent noiseless receiver— 
corresponding to the diagram in Fig. A-13b. 


Satellite ground station 

The signal received at a satellite ground station is extremely weak. Fortunately, the 
accompanying noise comes primarily from “cold” atmospheric phenomena and has 
a very low temperature. Minimizing the receiver noise is therefore both essential 
and justifiable. (In contrast, a receiving antenna nr at or Ope the horizon 
picks up blackbody radiation from the "hot" earth; then Jp ~ Jp and the receiver 
noise will be relatively less important.) 

Figure A-14 depicts an illustrative low-noise microwave receiver for a satellite 
signal with frequency modulation. The waveguide is part of the antenna feed struc- 
ture and introduces a small loss; the corresponding effective noise temperature is 
J, = (1.05 — 1)290 = 14.5 K, from Eq. (20a). Two preamplifiers are employed to 
mitigate the noise of the high-gain FM receiver. Inserting numerical values into 
Eq. (21), we get the overall effective noise temperature 





ti 


1.05 x 170 " 1.05 x 1860 


T, Scb 105XxX9+ 
ave Tees 100 100 x 10 


I 


14.5 9: + 1.8 + 2.0 = 27.8 K 


Notice that the waveguide loss accounts for half of J,, whereas the noise from the 
FM receiver has been nearly washed out by the preamplification gain. 
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i Preamplifiers 


Waveguide 







FM receiver 


S/N 
Br225MHz eas 


Sp=-90dBm L,=0.24B 8 = 20 dB g3 = 10 dB g4 = 60 dB 
= 10-2W =1.05 = 100 =10 
Tp=30K TF amb = Jo 95,29K 942170K F, =1860K 


Figure A-14 Satellite ground station. 


The system noise temperature is Jy = Tp + 9, = 57.8 K ~ 0.29,. Therefore, 
using Eq. (23), 
S 107”? 
(=) BL = 50 = 174B 
N/, 4X10! X02 X25 X 10° 


This small signal-to-noise ratio would be insufficient for analog communication, 
were it not for the further improvement afforded by FM demodulation. 





Suppose the parametric amplifier in Fig. A~14 could be mounted directly on the EXERCISE A—-3 
antenna, ahead of the waveguide. Find F y with and without the FET preamplifier. 





Cable Repeater Systems 


The adverse noise effect of lossy two-ports obviously cannot be avoided in cable 
transmission systems. However, we previously asserted that inserting repeater 
amplifiers improves performance compared to cable transmission without repeaters. 
Now we have the tools needed to analyze the noise performance of such repeater 
systems. 

A repeater system normally consists of m identical cable/repeater sections like 
Fig. A-15. The cable has loss L, and ambient temperature F me ~ Fo, so F, = Le. 
The repeater amplifier has noise figure F, and gain g, = L, to compensate for the 
cable loss. We'll treat each section as a single unit with power gain 


Ser = (ML,)g, = 1 
and noise figure 


Pera 
fuso ; = L, PLP) = LF, 
(1/L.) 
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Problems 


A-1* 


A-2 


A-3 


A-4 


A-5* 


A-6 


` Furthermore, we can reasonably presume that the noise temperature at the transmit- 
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Repeater 







Cable 


gc = UL, 
7 amp = Fo 8r= Le 


Figure A-15 One section of a cable repeater system. 


calculated from Eq. (22). The overall noise figure for m cascaded sections then 
becomes 


F = mF,, — (m — 1) (24a] 


= mF, [24b] 


The approximation in Eq. (24b) assumes F,, >> 1, consistent with L, >> 1. This 
result explains the rule-of-thumb saying that "doubling the number of repeaters 
increases the noise figure by 3 dB.” 


Most systems do have F,, >> 1, so the effective noise temperature is 


T, = (F — 1)9, = mF, 9, 


ter will be small compared to J.. Under these conditions, the transmitted signal 
power 5, yields at the destination the predetection signal-to-noise ratio 


(&) TEC PEE. NER Hz) 

N R` k3,By m(kF., SB) m\ N 1 

where (S/N), = Sz/(kF,,IoBr), which corresponds to the signal-to-noise ratio at 
the output of the first repeater. 


Obtain expressions for v2( f) and (f) when resistance R, at temperature J, is 
connected in series with R, at temperature J,. Check your result by taking 
31295,2383. 

Do Prob. A-1 for a parallel connection. 

Find u2(f) when the circuit in Fig. A-7 has J, = J, = 9, R, = 1, R, = 9, and 
L = 1/27. 

Let the inductance in Fig. A~7 be replaced by capacitance C. Find v2(f) when 
J, = J, = F and R 5 R =R. 

Let the voltage in Fig. A-3 be V >> kT/q. Write i;( f ) in terms of r and explain 
why junction shot noise is sometimes called “half-thermal noise.” 


An amplifier with r; = r, = 50 Q is connected to a room-temperature Source with 











A-10* 


A-11 


A-12 


A-13 


A-14* 


A-15 
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R, = 50 Q. The amplifier has | H(f) | = 200 TI[(f — £.)/B] for f = 0 and 
maf) = 2 X 107'°TI[(f — f,)/B], where f, = 100 MHz and B = 1 MHz. Find 
gBy, kJ ,, and N,. 

An amplifier with g = 50 dB and By = 20 kHz is found to have N, = 80 pW 
when J, = Jo. Find J, and F, and calculate N, when J, = 29. 

When the noise temperature at the input to a certain amplifier changes from Jọ to 
29, the output noise increases by one-third. Find J, and F. 

A sinusoidal oscillator may be used in place of a noise source for the measure- 
ment process in Example A-2 (p. 772). The oscillator is connected but turned off 
for the first measurement, so its internal resistance provides the source noise. The 
oscillator is then turned on, and its signal power S is adjusted to double the meter 
reading. Obtain an expression for F in terms of S, and discuss the disadvantages of 
this method. 

Impedance matching between a 300-Q antenna and a 50-6) receiver is sometimes 
accomplished by putting a 300-0 resistor in series with the antenna and a 50-0 
resistor across the receiver's input terminals. Find the noise figure of this resistive 
two-port network by calculating its power gain with a 300-6) source resistance and 
a 50-2 load resistance. 

Two cascaded amplifiers have the following properties: J; = 39, gı = 10 dB, 
F, = 13.2 dB, g, = 50 dB, B, > B; = 100 kHz. What input signal power is 
required to get (S/N), = 30 dB when J, = 109? 

A system consists of a cable whose loss is 2 dB/km followed by an amplifier with 
F — 7 dB. I£ J, = Go, then what's the maximum path length for 

(S/N), = 0.05(S/N),? 

A receiver system consists of a preamplifier with F = 3 dB and g = 20dB;a 
cable with L = 6 dB and J mp = Fo; and a receiver with F = 13 dB. (a) Calcu- 
late the system noise figure in dB. (b) Repeat part a with the preamplifier 
between the cable and the receiver. 

A receiver system consists of a waveguide with L, = 1.5 dB; a preamplifier with 
8, = 20 dB and J, = 50 K; and a receiver with F, = 10 dB. To what temperature 
must the waveguide be cooled so that the system has J, = 150 K? 

The Haus-Adler noise measure for an amplifier 1s defined as 

M = (F — 1)/(1 — 1/g). Show from noise considerations that in a cascade of 
two different amplifiers, the first amplifier should have the lower value of M. Hint: 
Write Fi; — Fa for the two possible configurations. 





Fourier Transforms 


Definitions 


Transform 


Inverse transform 


Integral theorem 


Theorems 
Operation 
Superposition 
Time delay 


Scale change 


Conjugation 
Duality 
Frequent translation 


Modulation 


Differentiation 


Integration 


Convolution 


Multiplication 


Multiplication by t" 
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Table T.1 


Wf) = F[v(2)] = | vo Raf. dt 


v = FIV] = | 


—Oo9 


Function 
ayv,(t) + aw(t) 


u(t — ta) 
v(at) 


v*(r) 
Va) 
v(t)e ^*' 
v(t) cos (wet + $) 
d"v(t) 
dt" 
| v(A) da 


—0oQ9 





U*w(t) 
v(t)w(t) 


t"v(t) 


oo 


oo 


V(f)e ?"* df 


| veo a = | vwa 


—oo 


Transform 
aVi(f) + aV{f) 
Ve 


1 f 
Tal (5) 
Veep) 
u(-f) 
Yr — fa) 
(Wf — fajet + WF + feje] 





(nf YS) 
l i 
Pa + 3 V0) Sf) 
VWF )W SF) 
V= Wf) 
d"W(f) 


(yt 


df" 
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Transforms 


Function 


Rectangular 


Triangular 
Gaussian 


Causal exponential ` 


Symmetric exponential 


Sinc 


Sinc squared 


Constant 
Phasor 
Sinusoid 


Impulse 
Sampling 
Signum 


Step 





v(t) 
n (+) 
P 
t 
«() 
g "e 
eult) 


gn 


sinc 2Wt 


sinc? 2Wt 

l 

e ierte) 

cos (wst + $) 


&(t — t4) 


> S(t — kT,) 


k=-00 


sgnt 


u(t) 


Vf) 
T sinc fr 
T sinc? fr 


(1/5) er 


ale af — f) + e*S(F + £)] 


EL 


XX MP 


A-0090 
\/jnf 
l 


pu A) 
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Table T.2 


Fourier Series 
Definitions 
If v( f) is a periodic with fundamental period 
To = lf = 2r/w 
then it can be written as the exponential Fourier series 


XN | 
vlt) = Y, c, ee 


n--oo 


The series coefficients are 


t To 
py | v(t)e "v dt 
0 


ü 
where t, is arbitrary. 
When v(t) is real, its Fourier series may be expressed in the trigonometric forms 


eo 
u(t) = c, + >) |2c,| cos (nwy t + arg c,) 
n=} 


oo 
=c,+ bi (a, cos nwo t + b, sin nog t) 


n=1 


where 
a, = 2 Re [c,] b, = —2 Imo] 
Coefficient calculations 


If a single period of v(t) has the known Fourier transform 


vom), 


1 
€, = T Ano) 





Lf) =F 





then 











TABLES 783 


The following relations may be used to obtain the exponential series coeffi- 
cients of a waveform that can be expressed in terms of another waveform v(t) 
whose coefficients are known and denoted by c,(n). 


Waveforms Coefficients 
Ac,(0) + B n=0 
Av(t) + B Ac,(n) n*0 
v(r- r) e (n)e7ns 
dv(t)/dt (J2mnfy)c,(n) 
u(t) cos (mat + $) lo(n — m)ei? + 4c,(n + me * 


Series coefficients for selected waveforms 


The waveforms in the following list are periodic and are defined for one period, 
either 0 < ż < Ty or |t| < T,/2, as indicated. This listing may be used in con- 
junction with the foregoing relationships to obtain the exponential series coeffi- 
cients for other waveforms. 





Waveform Coefficients 
Impulse train 
&(t) jt} < Ty2 1/To 
Rectangular pulse train 
TIí(t/7) |t| < Ty2 (t/Tp) sinc nfor 
Square wave (odd symmetry) 
+1 0<1t<T,/2 0 neven 
-1 —-Ty2 «t«0 —j2/mn n odd 
Triangular wave (even symmetry 
4|t| 0 n even 
EE? et (2/an)? n odd 
Sawtooth wave 
1/2 n=0 
t/ To 0ct«Ty 
j/27n n0 
Halt-rectified sine wave 
sin Wot 0ct« Ty2 WEE n even 
0 To/2 « t < Ty —jn/4 n= Ł1 
0 otherwise 


784 


TABLES 


Table T.3 


Mathematical Relations 


Certain of the mathematical relationship encountered in this text are listed below 
for convenient reference. However, this table 1s not intended as a substitute for 
more comprehensive handbooks. 

Trigonometric identities 

e} = cos 0 + j sin 0 

et + eB = 2 cos (a — B)ei@t?) 
e?" — eB = j2 sin (a — B)ei@*?) 


ae ; 
cos 0 = 26^ + e 7) = sin (0 + 90°) 


La ; 
sin = oe — e 9) = cos (8 — 90°) 
J 
sin? 0 + cos? 0 = 1 
cos? @ — sin? 6 = cos 20 
cos? @ = 4 (1 + cos 20) 
cos? @ = 1(3 cos 0 + cos 30) 
sin? 0 = 1 (1 — cos 28) 
sin? 0 = } (3 sin 0 — sin 30) 
sin (e + B) = sina cos B + cosa sin B 
cos (a + B) = cosa cos B F sine sin B 


ee S tan « + tan B 
ARE — 1 * tana tan B 
sin o sin B = } cos (a — B) — ł cos (a + B) 


cos æ cos B = 4 cos (a — B) + 1cos(a + B) 
sin a cos B = 4 sin (œ — B) + $ sin (a + B) 
A cos (0 + œ) + Bcos(0 + B) = Ccos@ — Ssin0 = Rcos(0 + $) 
where 
C= Acos a  Bcos B 
S=Asina + Bsin B 
R= VC? S = VA? + B. + 24B cos (a — B) 
A sin a + BsinB 
A cos a + B cos B 








S 
d= arctan = arctan 
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Series expansions and approximations 
n(n — 1) 
2! 


2 


(1 +x) = 1 +nx+ eet eee |nx| <1 
1 2 

e=lext—xt+ >: 
2! 


a=1+ xina + (xina) Toe 


a(l tx) =x—$x? +423 4 mas 


sinx =x ge tye T 
1 1 
cosx=1—-—x+—x- >: 
2! 4 


tanx —-x b ix E34 
arcsin x = x Fix 4 $x cc 


xix tix oe |x| <1 
arctan x = 1 1 
Ti+ x51 
2 x 3x 
sinc x = 1 = — (rx)? + (7x)? — 


x9 XY - cho Y" ato G)" - 


go x?«1 
h(x) x , 


IN 2arx x SS 1 





Summations 
ud M(M + 1) 
Sim Lice Wie ee 
m=) 2 
ep a MUM + DOM +1) 
m ae 
m=) 6 
Sa _ M*(M + 1)? 
m=) 4 
M „M+ —] 
mal m 
m=0 x—1 
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Definite integrals 


ceo 





























| diee ee n>m>0 | sitas = | cos x? dx = =,/— 
pc b qox sin (mm/n) A d 2-N3 
? sin ? tanx T a ! 
| Ea | dx = — | wesa nzla» 
x x 2 a" 
0 0 0 
T i 22 1 
— qi«l1 [eat Ve a0 
E 2 " 2a 
sin x cos ax - 
x us ug EU l 
0 4 xe dx =} Vr 
0 d 9 
? sin! x T T 
| CREE, odere | eed e 3 a0 
ge t 2 ? loa 
F ake et [em saxis - a0 
"E 2 d 1-gd 
| sinc xdx = | sinc? x dx = i | e^ ** cos bx dx = do Ve" hl 
0 0 0 ; 2a 
Schwarz’s inequality 
b 2 b b 
[vama dÀ «| joala | |w(A) |? da 
a a a 





The equality holds if v(A) = Kw(A) where K is a constant. 


Poisson's sum formula 


Ss eg TaNL =L bx 5(A n mL) 


n---oo m- --oo 





The Sinc Function 


Table T.4 


TABLES 


Numerical values of sinc x = (sin 7x)/7rx and its square are tabulated below for x 


from 0 to 3.9 in increments of 0.1. 


0.0 
0.1 
0.2 
0.3 
0.4 
0.5 
0.6 
0.7 
0.8 
0.9 
1.0 
1.1 
12 
L3 
1.4 
1.5 
1.6 
1.7 
1.8 
1.9 


sinc x 
1.000 
0.984 
0.935 
0.858 
0.757 
0.637 
0.505 
0.368 
0.234 
0.109 
0.000 
—0.089 
—0.156 
—0.198 
—0.216 
—0.212 
—0.189 
—0.151 
—0.104 
—0.052 


sinc? x 
1.000 
0.968 
0.875 
0.737 
0.573 
0.405 
0.255 
0.135 
0.055 
0.012 
0.000 
0.008 
0.024 
0.039 
0.047 
0.045 
0.036 
0.023 
0.011 
0.003 


2.0 
2.1 
2.2 
2.3 
2.4 
2.5 
2.6 
2.7 
2.8 
2.9 
3.0 
3.1 
3.2 
3.3 
34 
3.5 
3.6 
37 
3.8 
3.9 


sinc x 
0.000 
0.047 
0.085 
0.112 
0.126 
0.127 
0.116 
0.095 
0.067 
0.034 
0.000 
—0.032 
—0.058 
—0.078 
—0.089 
—0.091 
—0.084 
—0.070 
—0.049 
—0.025 


sinc? x 
0.000 
0.002 
0.007 
0.013 
0.016 
0.016 
0.014 
0.009 
0.004 
0.001 
0.000 
0.001 
0.003 
0.006 
0.008 
0.008 
0.007 
0.005 
0.002 
0.001 
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Table 1.5 


Probability Functions 


Binomial distribution 


Let the discrete RV J be the number of times an event A occurs in n independent 
trials. If P(A) — a, then 


Pi) = ("Jers — ay i70,1, ...,n 
i 


E[I] = na o; = na(1 — a) 
If n > 1,a < 1, and m = na remains finite, then 
mi 


P,(i) = e " — 
i! 


Poisson distribution 


Let the discrete RV J be the number of times an event A occurs in time T. If 
P(A) = #AT < 1 in a small interval AT, and if multiple occurrences are statisti- 
cally independent, then 


ray eg yt 


Uniform distribution 


If the continuous RV X is equally likely to be observed anywhere in a finite range, 
and nowhere else, then 

1 
b—a 





axxxb 


px(x) = 


E[X] = 5 (a+b) ok = = (b- a) 
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Sinusoidal distribution 


If X has a uniform distribution with b — a = 2m and Z = A cos (X + 0), where A 
and @ are constants, then 


1 
Pz) = — = gos 
TVAE — 2? 
E[Z] = 0 g2-—-—A 
Laplace distribution 
If the continuous RV X is governed by 
a 
= Ż gdl 
Px(x) 2 e 
then 
2 
E|X] =0 o? = p 


Gaussian (normal) distribution 


If X represents the sum of a large number of independent random components, 
and if each component makes only a small contribution to the sum, then 


pe) = ceo ern 
270? 


EX|-m o=o 
(See Table T.6 for gaussian probabilities.) 
Rayleigh distribution 


If R? = X? + Y?, where X and Y are independent gaussian RVs with zero mean 
and variance c^, then 


r 25,42 
Par) =e"? rz0 
g 
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Table T.6 


Gaussian Probabilities 


The probability that a gaussian random variable with mean m and variance c? will 
have an observed value greater than m + ko is given by the function 





SEAT LOS UNT, 


Qi = I. | e "P dy 
2T k 


called the area under the gaussian tail. Thus 


P(X > m + ko) = P(X = m — ko) = Ak) 
P(|X — m| > ko) = 2Q(k) 
P(m < X m + ka) = P(m - ke < X = m) =3- Ok) 
P(|X -m| = ko) = 1 — 20(k) 
P(m -ko «Xm ko) =1- Qk) — Otte) 


Other functions related to Q(k) are as follows: 


k 
erf k Š v | e* dk = 1 — 2Q( V2 k) 


T +0 


A 2 seer 
ecto k Z f e^ dÀ—1 — ert k = 2Q( V2 k) 
T 4k 


& du Fa 
me) * ferna =- ow 
2T 0 





All of the foregoing relations are for k = 0. If k < 0, then 


Q(-|kl) 7 1— Gk) 
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Numerical values of Q(k) are plotted below for 0 = k = 7.0. For larger values of k, 


Q(k) may be approximated by 


1 


A= Vrk 





which is quite accurate for k > 3. 
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10-8 


10-9 


10-10 


10-11 


10-32 
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Table T.7 


Glossary of Notation 





Operations 
z* 
Re [z], Im [z] 
|z| 
Im [z] 
arg z = arctan 
Re [z] 
1 [72 
(v(t)) = lim = | v(t) dt 
Too T 1/2 
F[v(t)] = | v(t)e P dt 


stay - | veal 


U x w(t) = | v(A)w(t — A) dA 


v(t) = + |. MN 


NES 
Ry(T) 

R(T) = R«(7) 
Gf) = F[R(T)] 
E[X] 

E[v(t)] 


Complex conjugate 
Real and imaginary parts 
Magnitude or absolute value 


Angle of a complex quantity 
Time average 


Fourier transform 
Inverse Fourier transform 
Convolution 


Hilbert transform 


Crosscorrelation 

Autocorrelation 

Spectral density or power spectrum 
Mean or expected value 

Ensemble average 
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Functions 
1 err 
Q(k) = — | e ^P dy Gaussian probability 
V 2a k 
expt — e Exponential 
: sin 7t . 
sinc f = Sinc 
qr 
ent = Die Sign 
"B -1 1<0 J 
I bv 
u(t) — b m Step 
LB. l< = 
ft 2 
n(2) = Rectangle 
T T 
Q^ Deeper 
2 
[z] 
a «c 
(t) T MEL Triangle 
0 


jz] > T 


“Equals by definition” 


“Approximately equals” 
ST 
| where t, is arbitrary 
h 
Denoting a Fourier transform pair 


Binomial coefficient 


Text material that may be omitted 
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Solutions to Exercises 


2.1-1 v(t) = 3 cos (270t + 180°) + 4 cos (2715t — 90° + 180°) 


fi) i Pa, 


180° 90° 


Amp. 


180° 90° 


Phase 
e 
~ 
o 


-90° 


2.1-2 
A/S 








P=7+2X57+2xX2?=107 


2.1-3 
224 V(f) afa RUP LE UM T 
h, = € = toa 7 
1 w l]c-(b/oy B -c-(2«xfy 


wore sus z 


b/w 2Ab 





Ae s HM 





T " 24? [0 df 
um MEV GI or le roar abun OB 

or 5 (2rf/b) Tib sin 7/ 

co A? 

2 dt = al CE des 
[lo irae = e| cau 
7 "T " = Ag) = P uy 

22-3 z(t) = V(t) with b = 1 and 2A = B, so Z(f) = Ae uo 
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23-1 


2.3-2 


2.4-1 


2.5-1 


2.5-3 


SOLUTIONS TO EXERCISES 795 


F[v(-1)] = TEST [V CP) + CB] = VAF) - JV) 


Af) = aV) + jVG)] + ax V) -JV4G)1 
= (a, + a) VAT) + (ai — a) Vf) 


Al li: v(t)e P a - | v(t)(—j2nt)e ?"* dt = —j2mS[rv(t)] 
—o9 i d =00 
Thus, tu(t) e Imr qp M) 
A? f 
E EA A 
(A sinc 2Wt)? <> — II (5) eel (5) =<2W \2W 
2W 2W/ 2W \2W 0 if] > 2w 


(a) E v(t)8(t + 4) dt = v(—4) = 49, 


(b) TE S(t + 4) = v(t + 4) = (+ 1* 
(c) v(t)d(t + 4) = v(—4)8(t + 4) = 498(t + 4) 
(d) v(t) * 8(—1/4) = |-4|v(r) * 8l) = 4(r — 3)? 


FI Ault) cos ext] = 2 | : 


1 
=6 — 


— — | 


dv(t) _ 2A (£) - ( z) - («- 2) 
a Ne A0 tt A0 t7 5 


jonfV(f) = 2A sinc fr — Ae"? — Ag Inf 


Vf) = B (cos mfr — sinc fr) 


v(t) 





dv/dt 
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31-1 g(t) = e^" u(t) 


du d ; 1 
A(t) = YRC LZ 4 (WRC t) = &t) — —t/RC. 
Geers E EIU CR OE EO 
j2nfL Jf 





312 WP) = eL AE 


31-3 H(f) = Tsincf T e ?"/7, X(f) = Ar sinc fr 
T « T, Y(f) ~ AtH(f), y(t) ~ Ath(t) 





7 =T, X(f) = AT? sinc? f T e #7, y(t) = ara(! =") 


T >> T,Y(f) =~ TX(f), y(t) =~ Tala) 





f 1 
d = — < 
a \ oe i26 P5 Ifi 30 kHz 


3.21 t{f) = i i 
(- ma) = 2 |f| > 30 kHz 


tg 
1/120 ms 


0 30 kHz 


1/4) H(F)| 


1 20 kHz 
f 


3.2-2 lH. Ff) | 








|f| < 20 kHz 


i RI 


|f| > 20 kHz 


10? 
120 
0 |f| < 30 kHz 


a 10? T = — 7/3 rad = -60° 
2m + l3 
Tf ig v5 ifi 0 kHz 





arg Hef) = —2nf — arg H(f) 











3.3-1 


3.3-2 


3.4-1 


3.4-2 


3.4-3 


3.5-1 


3.6-1 


SOLUTIONS TO EXERCISES 797 


(a) 
Laem = 10 iogo( 





P edo 10? 2 
1 mW 1W 

P 3 
(b) 


1 
| H(£)|* = 1003849 = 19793 = 0.501 = |H(f)| ~ —= 


v2 
(a) 33 dBm — 24 X 2.5 dB = -27 dBm = 10 ^ mW = 2uW 
(b) —27 dBm + 64 dB - (40 — 24) x 2.5 dB 
= —3 dBm = 10 0? mW = 0.5 mW 
Let f, =f, + B/2 = (f; + f,)/2 and let V(f) = 2KII(f/B), so 
HQ) =S IVI d WT Ze 
h(t) = v(t — tj) cos o (t — ty) where u(t) = 2BK sinc Bt 


_ 1 poe 
IH(f)las = 101810 Y By = 10 logio B 

= f 

= —20n logio B forf B 


20 
|H(2B)|as ~ —20nlogi2 = —60n = —20 dB =n = — nmin = 4 


6 
Tmin = 10 us, but the minimum pulse spacing is 30 us — Tmas = 5 us, SO 
1 1 
= ———— = 100kHz, =~ =5 
2X 5 ps 5 pee 
Fl x(t)] = (—jsgnf)X(f) and 


j- a. Hig) = sje 3s 
sx (- L)| = em rout) 


^ 1 
=x) so (- 1) =x0 
Let z(t) = u(t) + w(t) where v(t) = 4 eld pivot 
w(t) = & e Jég jest 
then R(T) = Q since Wy Xm @,, SO 
RÀT) = R(T) + Br) = |$ t |P 


m Swi 2 
+ |fe J$ | Zejo = & COS WoT 
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3.6-2 z(t) = w* (—1t) = w(-2), 


Ry {T) = | vara — À) dar 


—oo 
E, = E, = A?D 
|Ruw(7) |? max = (4^Dy = = E vEw at 
T= —l4 
Ry wT) 
A2D 
-À v(A 
i z(t — A) 2 (A) 
=> 
À T 
T+ T+t+D 0 D -t-D  -t -u*D 0 


36-3 F [v *(-r)] = K * (—r)e ^" dr 


—00 


i 


-fo * (A)e 79^ dÀ 
| 


| v Je “maa | = V*(f) so 


Gof) = #,[R(7)] = F[v(7) «v *(—7)] 
WF)V* (G) = |WADL? 
41-1 Valt) = z(t) + z * (D), z(t) = volt) 
Vaf) = Flz(t)] + 9[z* (t)] where 
F[z(t)] = Vif — fe) 
and ¥[z* (t)] = Z* (—f) = Vg(-f — f) 
so Valf) = Vit — f) Vane fa) 
4.1-2 Hyf) EUROPAS. dice d eom 


II 


Kj, 
Yy(f) = KoXi(f) + jaf | j2rfXip(f )] where 


X(t) = A(t) cos wot => n = 5A,(t) 
cR E 
2mf. dt 
—K, dA,(t) 
2mf. dt 





so Yip(t) = 7 KoA,(t) «| 





yt) = KoA Q), y(t) = 
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4.2-1 








reversals 
2 Py 
42-2 DSB: Sp = 2P,, = 20 W, AL, = = 200 W 
S,/4 
Py, 
AM: P, = 73> = 100 W 2$, = P, + 2P,, = 120 W and 
2 B'S, 
a2, = L2 = g00 W 7 
max SH 


Ac A, 
42-3 x,(t) => cos (ca, — W,)t + 7j 008 (c, + t 





A, A. 3A, 
v; = z COs Ont + D COS QI = ER W mt 


Ug = — Sin cf sin w,,t = 4 sin «t 


4 





cCOo———— —— ——— A 
A(t) = V ÈA. cos ot)? + (A, sin o,t)? = x V9 cos? Wat + sin? wat 
Cc 2 A. 
x V8 cos at 1 = 7E 5 + 4cos2o,t 


() : A,/4 sin ot (= :J 
t) = — — t 
$ TO 3A,/4 cos ot cue 3 


B/S 





43-1 Expanding cos? @ = 2 cos 0 + } cos 36, 
Voure = a(A, cos wt + 1x) + a (4? cos? wt + 23A, cos wt + 3) 
DII L= 


+ al A? $ cos wt + A? } cos 3w,t + 31 A2 cos? wt + 35 AL cos ot +$) 


LLLÓÁÁ——————— See 
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Only underlined terms are passed by BPFs, so 
X(t) = Vous T Vou- = 2ajx(t)A, cos wt = (2a,A,)x(t) cos wt 
44-1 x(t) = $A,A,,(COS Wat cos vt F sin ct Sin ct) 
= $A,A,,[cos (w, — @,)t + cos (w, + W_)t 
F cos (wm, — w,,)t E cos (w, + w,,)t] 
= lALA, cos (e, + wn)t 
A(t) = LA V AZ cos? Opt + A? sin? Opt = }A Ap 





44-2 X(f) = COS Wpf A,/2 x(t) = cos «t = A,/4 [cos(w, — v,)t + cos(w, + xw,)] 

=o f 

fe -fm Jet f ^Ín fe n: Jm 

Difference 
| » x i Ad4 - AJA | AL | 

f f f 

0 fn fe Ín fet fm fe 7 fm fet fn 

X(t) = cos (yt — 90°) 5 X(r)cos (wt — 90°) = 
A 


=E (cos (9, — @,)t + cos [(w, + Omt — 180?]) 


45-1 VA x Uf- CF € £2] 








a 


hf AA A SE ae SA, 


- -f -fi Si-h -fi +h f hth 





4.5-2 Leta = A,A,,/2, so 
A*(t) = (Ajo cos b' + acos wt)? + (Ajo sin o’ + asin wnt)? 
= Agote + 2ALoa(COS Wm ese E + sin w,,t sin $' j and 
cos(v, rre) 
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ay 2a 
A(t) = Aro | 1 + (5) + — cos (wnt F d) 
^ Aro) Aro 


7 Ajo t 3 A. As COS (Wat F Q') 
51-1 x,(t) = A,cos [wt + w,ux(t)t] => 6(t) = 2v ft + fauxlt)t] 
F(t) = ae A(t) = fe + faxt) + foux(t)e 
= f[1 + u COS Ont — po, sin ot] 
so fit) = —f.uoyt sin Omt for uo, t >> 1 and |f(t)| > oo as t— oo. 














Pa f Í 
s12 S(4()]- b aero) - SS (2L) 
7 Lh "m $8 A FT RM uu 
51-3 B = 8 kHz/4 kHz = 2 58 
fa = 30 kHz 
fa = 11 kHz 


Note "folded" terms at 
|11 -— 12| = 1 kHz 


|11 — 16| = 5 kHz f kHz 





52-1 
l D 2M(D) Approximation 
03 30 J2D*1)-26 
3.0 10 2(D + 2) = 10 
30 Ur 2(D + 1) = 62 


52-2 Since H(f + f.) = e ?™"4, we have Ky = 1, K, = 0, 

and tj = 0 in Eq. (12), so A(t) = 

f(t — tj) = B sino,(t — ty) 

= B(cos ct, sin ct — sin cf; cos ct) 

 B(sin Omt — wf; cos c, f) mt) « m 

and y(t) = A, cos (wt + B sin Wnt — Bot cos ot) 
)| 41 and f(t) =f. + Bf, cos Wnt, so 
HIF] = —2«mt[f(t) ^ fe] = —^Bo,t, cos wnt and 


Y(t) ^ A, cos (wt + B sina, — Bo,t, cos Omt) 
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53-1 x(t) = A,cos wt — A. ax sin wt = A, V 1 + ($ ax)? cos [ot + arctan ($ Ax)] 
Thus, d(f) = arctan (gx) = éax(t) - $6àx (0) + 6x (n c 
5.3-2 x(t) — x(t — to) = tox (t) = nens Rc pue (0.(t) + 180°] 


+ 


= to X(t) 
3 Env DC 
ty 277A t 
x block i n s SE 


Delay to 


6; 
5.4-1 1+ cos0; = 2 cos? PES 








it 
A,(t) = A, Va + 26088, = As 4 cos? = 2A, cos S 
sn. 2 sin cos 2 l 0, 
FE = S = fanc so 
1 + cos 6j 6; 2 
2cos— 
2 
y(t) = arctan| tan 35e; 
Envelope detector t 
pe 
12 f, 
T fi 
Phase detector t 


A2 
Frequency detector t 


54-2 Aj, = Am V1 + (f/Bae)” I T WA * T5 ~ 05 


B = 0.5 x 75 kHz/15 kHz = 2.5, M(B) ~ 4.5 
B~2X45 X 15kHz = 135 KHz < B, 








6.1-1 


6.1-2 


6.3-1 


7.1-1 


7.1-2 


SOLUTIONS TO EXERCISES 803 


n=—00 


&0 7. S oe) = (02) oe. = ER) 


Thus, S) = dl 2 = Peter] =f > P(nf,)8(f — nf.) 


n--—oo 


Sample values are identical, so the reconstructed waveforms will be the 
same for both signals. 


jx. 11 1 
lia = 10f, = 80kHz, Biz — = 40kH 
2 OE Fn T. : 





: T. 4 
C, = f, sinc nfjr = PL Tnf;T 
oo 


x(t) = alge + Y = sin anf, cos nos T = Toll + ux(t)] 


n-l 


= Afgro1 + ux(t)] + Y = sin (nmf,ro[1 + ux(t)]) cos nw,t 
n=l 


Jir = 7.0 and 10 < fro < 10.5 with f; — 10.5 = 7 =17 « f: < 17.5 
fir = 7.0 and 30 < fro < 31.5 with 31.5 — ff = 7 = 23 < f? « 24.5 
Jir = 7.0 and 30 < fro < 31.5 with f" — 31.5 = 7 = 37 < ff" « 38.5 
With 1st order Butterworth LPF, spurious rejection is 


1 
20 log L——— = —12.8 dB, —15.3 dB, and 
| V1 + (f/4)* 17,23,37 MHz 
— 19.4 dB 


A : 
Hef.) = 1, Hag( fi) = «jo» z i) where x =" 


c 


1 2 
RR=1 + s(x- 1) = EEEE ope 





20 
"d fir fe 
Bu — = 1 + > 1 sotake — - 20 and fir = 9.5f. 
f. f f 7 


(V2 cos wt)?v, cos wit = 3 vI(1 + cos 2w,t)v, cos wt 
= i vuv cos wt + components at |2f, + fi| 
AM: vuh = 1 + x,(t) + 2x2(t) + 2x,(t)xo(t) + xy(t)x3(t) + xxt) 
intelligible unintelligible 


DSB: v,v3 = x,(t)x3(t) unintelligible 
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1 1 


D 
X 
4 
I 


4 x4 X gg, Br = == = 8 X 750 = 6 kHz 





EL — ——————À 
1/750 


72-3 T, = (—-60)/(—54.5 X 4 X 10°) =~ 2.8 us, 

T,/M = 1/(10 x 8 x 10) = 12.5 us, 

T = 12.5/5 = 2.5 us, tf) €1(125 — 2.5 — 28) ~ 3.6 us 
7.3-1 





£ > 0 forall e 
so e(f) continually increases 
and e,, does not exist 


idi f= ag K= lafl = |a- E E aay 


O  om(Af-K) 








m 





74-1 n, = (37 cm X 40 lines/cm)(59 cm X 40 lines/cm) 








= 3.5 X 10° 
: 1 0.714 a : 
frame 7 42X10 ^ 1X1 MON 
74-2 -—sin ot = cos (ot + 90°) Aca 
Aa K Aw; |ux| < 1, and |ux,| << 1 Ay wat + h 
Thus, A, ~ A«(1 + ux) + Aca cos (ost + $) A ux, 


As + ux) 





SOLUTIONS TO EXERCISES 


81-1 M = 9 equally likely outcomes 
P(A) = 4/9 
P(B) = 3/9 = 1/3 
P(AB) = P(GG + RR) = 2/9 
P(A + B) = 5/9 
84-2 P(D) = 4/8, P(BD) = 1/8, 
P(B|D) = (1/8)/(4/8) = 2/8 = P(B) 
P(D|B) = (1/8)/(2/8) = 4/8 = P(D), 
P(B)P(D) = (2/8)(4/8) = 1/8 = P(BD) 








8.2-1 
Outcome GG GR GY RG RR RY YG YR YY 
Weights 22 2,1 20 -12 -L-1 -10 02 0-1 0,0 


X 20 0.5 1.0 0.5 —1.0 —0.5 10 -05 0.0 


x  -10 -05 00 05 10 20 
Px) 1/9 2/9 1/9 2/9 2/9 19 
F(x) 1/9 3/9 4/9 6/9 8/9 9/9 





P(-1.0 < X = 1.0) = Fx(1.0) — Fy(—1.0) = 7/9 


30/2 1 1 
8.2-2 Per < X <3n/2) = | ao aS 
T 


27 1 1 
P(X > 30/2) = | l—dxc-— 


37/2 2 4 
3n/2 1 1 
Pir < Z = 30/2) = CURL UE 
A T 


4 


82-3 py(x) = 1/4for0 < x = 4,g (z) = z? = dg (z)/dz = 2z 
p{z)=2/2 0«zz2 
=0 otherwise 


83-1 my = [ase =r X= Pelat 
0 0 
oy = V(Amr/3) - m = m/ V3, 
P(|X — my| < 2c) 
= P(r —2n/N3 < X < r * 20/V3) «1 


37/2 1 1 3 
P(r = ZS 32/2) = 39 m) dz. We, 
z T 
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8.3-2 


8.3-3 


8.4-1 


8.4-2 


8.4-3 


9.1-1 


9.1-2 


SOLUTIONS TO EXERCISES 


E[X + Y] 


il 
— 
8 
——— 
8 
en 
+ 
ed 
~~ 
AS 
>o 
aN 
= 
£ 
R 
© 


li 
i 
8 
taj 
ET. 
s 
8 
X 
A 
X 
= 
E 
i—i 


+i y | Px% y) dx dy 


= | m) dx + | ono) dy-X-tY 


®,(27t) = 9 ![a TI(f/a)] = sinc at, so 

P(r) = sinc at|.-,5, = sinc(av/21) 

m = 10* x 5 x 1075 = 0.5, P,(i) ~ e7°5(0.5'/i!) 
| 5° 0.5} 95 

Fj(2) = e( 9 + 2 2 


01 T + e) = 0.986 
o = 8 so9 = m + 050,25 = m + 2.50 
P(9 < X = 25) = P(X > 9) — P(X = 25) 
= P(X —m> 0.50) — P(X — m = 2.50) 
= Q(0.5) — Q(2.5) = 0.31 — 0.06 = 0.30 


: A img? 
| pA) dà = | (iem dk r=0 
oC 


—oo 0 


F,(r) 
Let a = A*/20* so 


rg? 
F(r) = | e€*da-l-e" r=0 
0 


u(t) = E[X + 31] = X 3t = 3t 
R, (ty, t2) = E[X? + 3(t, + t2)X + 9t,t.] 
=X? + 3(t, + )X + Ott, = 5 + 955 
v(t) = R,(t,t) = 5 + 92? 
E[z^(t, t2)] = Elv*(t)) + v(t) x 2v(t)v(t2)] 
= vt) + v5) + 2R (t,t) = 0 
Since v?(t) = R,(0) for all t, 
|R()| = Rie t— 7)| 


x i [v(r) + v'(t — 7)] = R,(0) 
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91-3 Being produced by a linear operation on a gaussian process, w(t) is anoth- 


92-1 


9.2-3 


9.3-1 


9.3-2 


er gaussian process with 

R(t), t2) = El 4u(t,)v(t,) — 16v(t) — 16v(t) + 64] 
= 4R,(t, h) — 16[v(t) + v(5)] + 64 
= 36e lial + 64 

Thus, R,(7) = 36e7™ + 64 and 

w = R,(0) = 100, m, = V R,(-Eoo) = 8, 

oy, V100— 8? =6 


Hence, w(t) is stationary and ergodic. 








RT) = E(v(t)u(t — 7) — myv(t — 7) — myv(t) + m] 
= Rr) — m} = mi mi 
Thus, R,(7) = R(T) + my=> GJ) = Gf) + mêl) 
Let w(t) be a randomly phased sinusoid with A = 1, so 
Gf) = 4 [8(f — f£.) + Of + f.)] and 
GLS) = Gf) * Gf) = 3G S) + GF * £2] 
Gf) = a?D sinc? fD ~ oD for |f| < 1/D. Thus, if B < 1/D, 


Gf) = ?D and R(T) ~ o?D«Be ""^ 


1 
1+ (By I 
—2242 
Pa 2(m4 x 1072) 
3 x 6.62 X 107? 
c, 1.206 mV. h/2kS ~ 8 X 10^ P so 
h| f\/2k7 < 1for|f| s 10? 


x 1000 = L6 x 10$ V?, 


10° 
| G,(f) df ~ 2 X 10°G,(0) = 1.6 x 107° V?, and 


— 10? 


|IHCP) = 1/[1 + (/BY"] and g = |E(0)|* = 1 so 


i MP Em 
UU he Bye Lae 


m[2n TB 


sin(m/2n)  2nsin (7/21) 
and [sin (7/2n) ]/(7/2n) = sinc (1/2n) > 1 as n — œ 
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94-1 Sp, + 174 — 10 logy (5 X 42 X 105) = 50 dB 
= Sp = —51 dBm = 84 X 1075 mW 
Sr = 10145, = 840 kW without repeater 
Sr = 840 kW/(5 X 10°) = 168 mW with repeater 
95-1 (a) o4/A = VN /2E, = 0.1, 
or = VNo/4BE,t = V/Ny2E, = 0.1 
(b) o4/A = V NoB7/A? = V NoBrt/E, = 0.4, 
oT = VNo/4Br Eyr = 0.025 


9.5-2 holt) = (2K/No)[u(t, — t) — uta — t — 7)] 


1 ty tár < t < ta 
0 otherwise 


so, with 2K/No = 1, hy(t) = u(t — t; + 7) — u(t — t4) 
Realizability requires Aj, (f) = 0 fort < 0, so take t; = 7. 











t-t 
hop (t) * x(t) = aa( 2 :) where, at t = ty, 
ta 
A= | A, dÀ = Ayr. 
fT 
101-1 G,(f) for f» 0 
Ny2 Ny2 
Ny ol o 
pum 2 f- f, MHz f. MHz 
G, (0) = 2K No/8 = Ny4 
= 107° = 
101-22 A, =a] T X = 1.3 mV and A? = 2 x 107° 
TT 
sog, =f2— 5 X 10^? = 0.655 mV 


Let à? = (2A, = 2x N, so P(A, > a) = e^" = 0.043 





102-1 


10.2-2 


10.3-1 


10.3-2 


10.3-3 


10.6-1 


SOLUTIONS TO EXERCISES 809 


GÁf) 
GP) 
No 
No/2 Ny2 
0 f,- Wi pa i -W -Wa 0 wa w | 
N = M yw Do 5 = 1250W 
S902 2 iur mec 


S 
(3) = 0.8y, 101og 0.8 = —1dB 
N/p 


S,=1=>A2=S,, P(A, = A) = 0.99 = P(A, > A.) = 0.01 


; S 
Thus, e ^/?N« = e -542N* = 0.0] and (3) = gie —- 41n10- 92 


N/R, 0.01 
ep) E n(L)~ ex 
Ha) | GAf Sel + (f/ Bae)” H Br 285 Bá. 


for B, < |f| < Bjy2 


N, By NBB Br \ NyBW 
Nom Ae pg x 251 = Ner. (Fr) oBa 


PCR 2 2S. 2W/ Sg 


fA N2. SKEM) SrR(PM) 
(Fe) ar T ars We ba = and SFM) = 1W 


S (PM) ( Fe y ( 7.5 J 

2 > ——— ] => 5,(PM) = 130W 
Sr (FM) Bae a X21 RM) 
Br = 5W = Ya = 10 X 5 = 50 


(3) => (Be)! x 1 x so = 2500 ~= 34 aB 
GEN Ni 2 








Tmax 7 Tmin = 2UTo S T, > ur, S T,/2 = 1/4W 

S {Ww w 1B, 

=] =4 ?B 2 Js = 4u? — Sy = c 
(5), (uTo) (e xY H ToBr a xY 2 Ww. xY 
since u € 1, To = TJ2 = 1/2f,, and f, = 2W 
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11.1-1 


113-2 


11.1-3 


112-1 


112-2 


112-3 


SOLUTIONS TO EXERCISES 


: 1 
sinc? at = t so take r = a 


H o 


l 
X MEE 


RI 


, > bee 


S[sinc? at] = 


Q le 


a(£) = 0 for |f| > aso B 2a>rsB 


1 
Pme 2 sne( Z) = O for fn S27 


f 


A? m A? 
Thus, Gf) = rm sinc a $ ru èlf) 


r= A?/2 by inspection of x(t) or integration of G,(f) 
f 


1 
P(f) -in(£ = 0 for |f] >> 


2 PM 2 2 
Thus, G,(f) = E as n(£), r= ra x22 Exe 
b b 


Y, rp 2 2 
Aja 2 2 Vi x 50 = 10 


P, = Q(0.4 X 10) ~ 34 xX 102, P,, = Q(0.6 X 10) ~= 12 x 10? 


P,— 4(P., + P) 13x10? 
whereas P, = Q(0.5 X 10) &3 x 1077 


1 No 
—, =—=N 
2r, y 2T We 


(A/2o)? = A?/4a? = 48p/4Nor, = Sp/Nory = Yo 


(b) Sp = 4A’,7 =3T, = 


P, = 3 X 2Q(A/20) + 2X 1 Q(A/20) = $ Q(4/20) 


A? (+) - 2Sr Sg 
2 , 2c ANg — 2Nor,/2 





SpA PEU) tas 


Dy Qlaz = 0) 


Py la, z-—A) 


Py laz =A) 








-AD 0 A A 





ial | 


] 


= yb 











113-1 


113-2 
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Note that P( f ) has even symmetry, so consider only f > 0. 


Os fs r2 2lr 


2/r 
r'2€ f&r "E 
Q0 f-r "n2 f 
1 [255 2A 2 2 
py) = | *(1-3)a- (1-2) 
20 r r 
2 P 
1-4) DEI Cf) 
Thus, P(f) = Vr 
2 | f | r 
—|1—-—42 5 =|f|<r 
4 á 1/2r 
-r -r2 0 2 r f 
t art , 
pt) = sine? =, p(t) = sino? 7 sinc rt po 


No additional zero-crossings, but | p(t)| « 0.01 
for |t| > 2D. 


o9 


IM) Par] IT dr vim 


—co 





Let Jap = | 


IPC£)] 


YU) = HG) VGA) WU) = veces) 





812 SOLUTIONS TO EXERCISES 
Then Tyg is minimized when V(f) = gW(f), so 
IP(F)| slP(f)| 
HAP) VG) = 87a AT AeA)? = 
i STEATE) VEA EA 
AE |P(F) |? IPF) | VG.) 
z E AAA 
PAH HAA — s|POP IB) 
113-3 
T m, m, my — my 
0 0 0 0 
0 1 1 0 
1 0 1 1 
1 1 0 —i 
0 m, = 0, M9 = 0 
0 m, = 0, mo. = 1 
fL) zm r — n GJA S= 
y(t) = (mi — mi-;) A m, = 1, m = 0 
—A m, = 1, mi.» = 1 
114-1 m, — mj t m, t m, + ms and output = ms 
shift m, m, m; m, ms shift m, m, ms; ma nis 
0 1 1 1 1 1 16 0 1 1 0 1 
1 0 1 1 1 1 17 1 0 1 1 0 
2 0 0 1 1 1 18 0 1 0 1 1 
3 1 0 0 1 1 19 1 0 1 0 1 
4 0 1 0 0 1 20 0 1 0 1 0 
5 0 0 1 0 0 21 0 0 1 0 1 
6 1 0 0 1 0 22 0 0 0 1 0 
7 1 1 0 0 1 23 1 0 0 0 1 
8 0 1 1 0 0 24 1 1 0 0 0 
9 0 0 1 1 0 25 1 1 1 0 0 
10 0 0 0 1 1 26 0 l 1 1 0 
11 0 0 0 0 1 27 1 0 1 1 1 
12 1 0 0 0 0 28 1 1 0 1 1 
13 0 1 0 0 0 29 1 1 1 0 1 
14 1 0 1 0 0 30 1 1 1 1 0 
15 1 1 0 1 0 31 1 1 1 1 1 























Cua wi iia adl 


tese 077 





12.1-1 


12.1-2 


12.1-3 


12.2-1 


1222-2 


123-1 


12.3-2 


SOLUTIONS TO EXERCISES 813 


36,000 
vf, = 36,000 and f, = 2W = 6400 v === -56—v-5 


so g = 2? = 32, f, = rjv = 1.2 kHz 


(a) 4.8 + 6v = 50 dB =v = 8, r = vf, = 80 Mbps 
(b) (S/N) = 48+ 6v — 10 = 50dB =v = 11,r = 110 Mbps 


3 — bit quantizer => 8 levels, with Xmas = 8.75 V => step size = 2.5 V. 
For an input of 0.6 V = x, = 1.25 V = e, = 1.25 — 0.60 = 0.65 V. 
In (1 + 255 X 0.6/8.75) 
— n ccce dep 
In (1 + 255) 
4.601 feeds to a quantizer — x} = 3.75 V. 


With companding: z(x) = s 


x, is then expanded using Eq. (13): 
8.75 
^ d 375/809 — 1] = Q. 
ra 255) — 1] = 0.34 
e, = 0.60 — 0.34 = 0.26 (with companding) versus 


&q = 0.65 (without companding) 
1 + 4q?P, = 10° = 1.259 = P, = 0.065/q” = 107$ 


M = 2, P, = Of V(S/N)g] = 10 5 = (S/N) zg = 4.76? = 13.6 dB 
Eq. (5) gives (S/N)s, = 6(27 — 1) = 12.6 dB 


















B W y 
X476 (Me —-1)—Miz-1-4 e eru EE 
EX Yih 
Thus, (S/N)p, = 3M2:5S, -s(1 + 3I S. 
For WBFM, (S/N)p, = 3(b/2)S, yy = 3 by, 
p ^ wW? W 

W2, = > ——— Wins = —— 

rms S, | a = df = 3 rms \/3 

fAV3 _ 

m 2moW | VA : 


TNS 
oop = —— In 2b = 0.393 VS, 
"Mb 


PCM: (S/N)p = 4.8 + 6.0v + 10 logio $. dB 
DPCM: (S/N)p = G,,, + 4.8  60v' + 10 logy) S, dB 
If G, = 6 dB, then 6 + 60vy' = 60v v' =v — 1 


Sopp ^ In 2b => A 


814 


124-1 


12.5~1 


13.1-1 


13.1-2 


132-1 


SOLUTIONS TO EXERCISES 


One frame has a total of 588 bits consisting of 33 symbols and 17 bits/symbol. 


But, of 17 bits, only 8 are info, so 8 info bits X 33 symbols frame = 
264 info bits/frame. 
Output is 4.3218 Mbits/sec X one frame/588 bits — 7350 frames/sec. 
Voice PCM bits/frame = 30 channels X 8 bits/channel = 240 bits, 
Thame = 1/(8 kHz) = 125 us 

_ 240 +n 


r= $us = 2.048 Mbps — n = 256 — 240 = 16 bits/frame 


a = 1075, 18 
P(0, n) = (1 — a)? = 0.985, P(1, n) = 15a(1 — a) = 0.0148 


15 X 14 
P(2,n) = cue tae o(1 — a)? = 1.04 x 1074 
15 X14X 13 4 - 33 
= = — = 4. x 
P(3, n) 2 rd (1— a) 4.50 x 10 


We see that P(2, n) 2» P(3, n), and P(4, n) will be even smaller, etc. 


Hence, >) P(i, n) ^ P(2, n) 
i=2 


We want R, = rj/r = 0.5, given 2tyr,/k = 2.2 and p = 10a = 0.011 





9 0.989 
- -N: R SS = 0.879 K 
Oo S 49 0080 a 0914 9 
; 9 0.989 
ome 2 e d --—- = f 

Stop-and-wait R} 101422 0.278 Unacceptable 
110 

(Cc, c; 6) =(m m m) 011 
101 


cı = m, 0 m, c = m Om, O0, c = 09m Om, 


mmm; €1€€3 Ww 
000 000 0 
001 101 3 
010 011 3 
011 110 4 
100 110 3 
101 011 4 
110 101 4 
111 000 3 
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Pu Pai Pa | 10 0] 
D "EE E = 0 
1322 S= Y[PT R] = 0 y e xu PP | 
Pis Pq por Pkq | 00 xe 1 
5; 7 yp D yip D OQ y,p; DOGOD: Oy DOD0D OO 
— : 
j — 1 terms 8 — J terms 
1110 
For P? = 0111 
1101 


5, = y, Dy. Dy DOD ys, 5. = OD y: Oy Gy, D yg, 
53 =y Dy, DO Dy, yy 


X; ya Y2 i Ye Ys Y» Y Js Y» Yz- D 


53 $5 51 


132-3 "j^ 21001010 Q,(p) 2 pb - p +p 
CRC-8: G(p) =p +p - p 1 
X(p) = Q.(p)G(p) = p^ + p +p + ph «p» pvp +p +p vp 
Y is received version of X with errors in first two digits, so 
Yip) epo pi d pup * p ep ep hp ep EP 
p + p +p+1 











QE cp ep + p+ PTF OF PE OEP ERT pa PEP TD OFF 
p? +p + p> + p° 
m +p +p + p* +p +p 
11 +p% + pt +p? 
pP +p PP ORP ERP EP 
p *ptptp 
p! ap 
p +p?+ptl1 
p? tp tpcl 


YI F 
S(p) = rem| Bey =p + p + p + 1 + 0 = an error has occurred 
P 


816 


SOLUTIONS TO EXERCISES 


ES 
13.2-4 is 


13.3-1 





14.1-1 


Hos 
Xj =m 


xj" =m @ my 


mt i 
xj" =m 3 © mji 


b= 001 


c=010 


d=011 


e=100 


f=101 


g=110 





h=111 





abce = D'I 


Minimum-weight paths: 


(b) a = 





1 
m €5285X10^-—P,-53x2*5 X a? = 3.5 X 1079 


20s 
1 
Pipe ae = 41x 10° < Pp 
Á V 40 É 


Coding increases error probability when R,d,/2 = 1. 


Br = 0.1f, = 100 KHz, r, = (r/B7) X 100 kHz 


(a) r/Br = l sor, = l00kbps (b) r/Br ~ 2 so r, = 200 kbps 


(c) r,/B1 = 2 log, 8 = 6 so r, = 600 kbps 





mde 


EI 


uas mes o 


asi 28? 


DENS AMO Sy 
soy Se alee d ^ 





SOLUTIONS TO EXERCISES 817 


T 1 ; i 
141-2 = £7 Sh = cos h, = A = sin = 


V2 


1 1 1 
x(t) = Dp IN) SG neun 
ME 1 A 
Q, —0-0c = Gf) 75 n PIG) 73r ne t 


1 Eoad 
Thus, G,(f) = 590) 4 ita 


Gf) AM 8 


141-3 xt) = A, X, [cos (waat) cos (wt + 0) — sin (waat) sin (wt + 8)]pr(t — kT,) 
k 
T 
where a, = xl, pr (t) = u(t) — u(t — Tj), wa = 7 7T" 
b 
so cos (wya,t) = cos wt, sin (wza,t) = a, sin wat. Thus, 
x(t) = >) cos (waat) Pr (t — kT,) = >) cos wat pr (t — kT,) = cos wat, and 
k k 
x(t) = pij sin (waa,t)pr (t TI kT,) = > ay sin Wat Pr (t = kT,). But 
k k 
i . Tt 2S E 
sin wt = sin 77 sin [5 (t — kT,) + km] 


b 


= cos kar sin [F (t — kT,)] = (—1)* sin [ar,(t — kT;)] 


so x(t) = 2 (-1Y'a, sin [mrt — kT;) lpr (t — kT;) 


ey —— — JU 
Q; p(T — kT,) 





818 


142-1 


142-2 


1433-1 


SOLUTIONS TO EXERCISES 


Let V(A) = 5,(A) — so(A) and W * (A) = A(T, — A), so 








ae f veow* o al E 
ki zol "— ^N E = | |(A)|? dÀ 
A e| |W(A) |? dA 9 ugs 


=r | EO sa 
The equality holds when W(A) = KV(A), so 
WT, — A) = Elsa) — 80)] N — 1) — s, — 0] 
h(t) = A.pr(T, — t) cos (2n(f. + fj)(T, — t)] with £.T, = Nofio = i 
= Adi, — 1) — 70] cos [2n(N, + 4) — 20 4: f 





= A, cos [2n(f. + AMO) - wt 7 m). = 
T, 
E, 
since [u(T, — t) — u(—t)] = [u(t) — ut — 75))] = H 5 


E ( x 2) 
AT, f f. 2 


sinc} ————— F sinc 
2 Ty lb 








Thus, 





H(f)| = 


HC ft IH Cf) 





fe n2 F fetrn2 


t 
2 
z(t) = [^ cos («A + 80) KA, cos (wt — w,A) dA KA, — rd 
0 b 


2A, 1 t t 
= -5> | | cos(w,t + 0) dA | cos (2w À — wt + 0) ddr 
T, 21d, 2 
sin (ct + 0) + sin (wt — 0) 


a (at +6) + 
^T cos (w, 2o, 


| Oct«T, 
b 


where cos (w,t + 0) = cos «t cos 0 — sin w,t sin 0 and 


sin (ot + 0) + sin (øt — 0) = 2 sin «t cos 0 


E - aec "i 


E 
E 
zl ia 
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Ac . osO\ . 
Thus, z(t) = — [es 0 cos ot — (sin ge ) sin " and 
T, wt 


C 





Act g 90V? 
ALt) = — 4[cos^ 0 + (sno zum ) 








T, od 
Act 2 sin 0 cos 0 (= 9 j 
T, wt wt 
oa 125 d 
ne y 
1432 A2 x2 x 10 $W. A: Bs <9 x 19-6 Ë 
eec [can , Yo = Ny A? A2 
Ej 1 : 
OOK: 33 = 7 = 25 X 10 5:0 y, = 5, PL d [e Q(V5)] +5 x 107 
c ry 
RSK: È = = 5 x qp =10,P,=4e5=3 x 103 
4 e 2r, E ° So Yb E a 
1 
DPSK: E ue X 10 $so y, x 10, P, = 3e719 2 x 10? 
Az 2r, 


144-1 Let y = w,t + $,so 
4 


4 OMM Ac 
X; = A% cost vy cos y Pi cosy + cos 34) cos y 


A‘ 
= mac + 4 cos 2 + cos 4d) 
where cos 4y = cos (4w,t + 4p) = —cos 4w,t since 4$, = T, 3m, Ta 


144-2 For correlation detection 


(c 1D 
y(t) -| x(A) KA,cos o, A dà, t, (k+1)D 
kD 


For filter detection y(t,) = | x{A) h(t, — A) dA 
KA,cosw,A kD SAS (k+1)D 
+1)D-A]= 
Tnus PUE p | js otherwise 


So h(t) = KA,cos [(k + 1)o,D — ot] Ot -D 
= KA,cos co pp(t) since wD = 2r N, 





E = 34D = z ae ee 
a ~ 2r E A 
N Ny [? 
o° 22 \A(t) |? dt = zf (KA,)? cos? «t dt 
= 0 
No 2 A2No 
= — mE DE =N, 


nivei 





r) 


M — 





1 
) + Benin = 


Y 





Dr 
2 


2650. 


Jay 


SOLUTIONS TO EXERCISES 
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010110 101100 000101 011001 001111 011001 110101 101001 110001 011111 100000 110110 





PN Seq. 
Assume 2nd term does not significantly contribute to the overall error so that with M — 54 users, 
54 — 1 


we have 
1 
1055 = :( 
2 


Multiple users: P, 


15.2-2 


SOLUTIONS TO EXERCISES 821 


But with FH-SS, Pg = 2^ = k = 12—Y = 4096. 
Comparing to Exercise 15.1—2, for M = 54, and the same P,, we require 
Pg — 1000. 


153-1 m, = m, + m; and output = ms 




















shift m m, m, m, ms shift m m, m m, m; 
0 1 1 1 1 1 8 0 0 1 0 1 
1 0 1 1 1 1 9 1 0 0 1 0 
2 0 0 1 1 1 10 0 1 0 0 1 
3 1 0 0 1 1 11 0 0 1 0 0 
4 1 1 0 0 1 12 0 0 0 1 0 
5 0 1 1 0 0 13 0 0 0 0 1 
6 1 0 1 1 0 14 1 0 0 0 0 
T 0 1 0 1 1 15 0 1 0 0 0 

shift m m, m, m, ms shift m m, m, m, m; 
16 ] 0 1 0 0 24 0 1 ] 0 1 
17 0 1 0 1 0 25 0 0 1 1 0 
18 1 0 1 0 1 26 0 0 0 1 1 
19 1 1 0 1 0 27 1 0 0 0 1 
20 1 1 1 0 1 28 1 1 0 0 0 
21 0 1 1 1 0 29 1 1 1 0 0 
22 1 0 1 1 1 30 1 1 1 1 0 
23 1 1 0 1 1 31 1 1 1 1 1 








The above output occurs with all 1s as initial conditions. Any other set of 
nonzero initial conditions will produce a delayed version of the above out- 
put. Therefore, this register configuration only produces one unique 
sequence. Any n-bit register configured to produce a ml sequence will 
only have one unique output sequence regardless of initial conditions. 


16.1-1 P + P, = 1 — P, so 2P, = 2P, = 1 — p and 


1 1-p 2 1 | 
H(X) = 2+2 1 = plog~ + (1 — p)|log 
(X) plog- p Eq IPIE (1 — p)| log 











+ log 2| 





1 1 
= plog- + (1—p)lo +(1-p)=Qp)+1-— 
pug (1 = p) EFF (1 — p) (p) p 


H(X) | max = log M = log 3 = 1.58 at p = 1/M = 1/3 
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déxed o 


16.1-2 


didai 


x 


i P; 1 2 3 Codeword 


N | 





Ny,-ixic-ixi-cixi-i- 
ee oe 
16.1-3 From Table 16.1-5 with p = 0.9 we have the data compression 
N/E = 0.50 so r,/r = 0.50. But R = rH(X) 5 r,, so 
H(X) = r,/r = 0.50 bits/sample. 
1 
162-1 H(Y | X) - P P los l 
(Y | X) cal 6| x1) log PO, | xj) (y2 | xi)log - P| XP 


+ P(x)| Por | xi) a + P(y2 | x2) log 


= 
l-@ 


= O(a) 


R3 





1 1 
= pla — a) log eme t alos +| 





1 
+ (1 - p| alog} + (1 — a) log 





1 
= a log gq + 1 ~ a) logs 


16222. P(xyj) = P(x)P(y;) Pily) = P i5) P(yj) = P(x) 
Thus, H(X | Y) = 2, PG)PO/) los gc 
1 
= [Er] > Po) log m = 1X H(X) 
so I(X; Y) = H(X) — H(X|Y) = 0 
163-1 (a) p(x) = 0 for |x| > M so 


M M 
r= | PO) log C a and | pisci f paci 


Inpt+l 
E à =0=> Inp=A,ln2-1l>p=e (iin 2-1) — constant 


1 M] 
Thus, p(x) = 2M —— for -M < x < M, and H(X) = | 2M —— log 2M dx 


= bs 2M 


16.3-2 


16.4-1 


16.5-1 


SOLUTIONS TO EXERCISES 823 


(b) p(z) = 1/2KM for -KM < z < KM so H(Z) = log 2KM 
But dz/dx = K so H,(Z) — H(X) = —log K and 
Hy (Z) — Hys(X) = log2KM — log 2M — log K 

= log [2KM/(2M X K)] = 0 


(a) R = rlog64 = Blog (1 + S/N) =r x (3 X 10° log 1001)/6 
= 5000 symbols/sec 

(b) S/NoB = 10 = S/N, = 3 x 10° x 10° = 3 x 10° 

B = 1 kHz: 


C = 10? log (1 + 3 x 109/10) = 12 x 105 r = 1.2 X 104/6 = 2000 


Bow: 


C4 = 1.44 X 3 X 10° = 4.32 x 105 =r < 4.32 x 10°/6 = 720,000 
(a) Jsi? = G V2? x2236 ¢ = 51/6 


2 
Q = E dt = 3i dt— 6 g, = sy — 66, 
o 





le^ =36 ¢2= 8/6 
2 
=@+6=72 lls 


(b) 


$5! 


























2 4 ; 
= | GV3 a =18x4=7 SER 
0 


Ee r= 1:2 
—-aq + (2a} i=3,4,5,6 
E = [2 X œ + 4 X 5a?]/6 = 11a?/3 


= = zo Nod = of e) 
VNJ2 NUM T VIIM 


Fori = 1,2 








P(e | m) = | nito dti | mp2 48; = (1 ~ 20) = a) 





Fori = 3,4,5,6 P(c | m;) = | P&(P1) ap, | Pa(B2) ap, EE (1 7 qy 


P, = 5 [2(1 7 24)0 — a) + 4(1 - ay] 2 $0 — 7a + 47) 
Thus, P, = 1 - P, = (74 — 49”) 


824 


SOLUTIONS TO EXERCISES 





sa ARI, ARS, Akf 9, |, 

BU) = TR pry? R? -4( 3) 
RR t jfR) _ RA * jf) a hpl ETS 

20) = RGR ep ANPR 














4+ f?’ 
EM L2 

2 = 2.2 us Sia +f?) 
vf) = |Z) VES) = 4kR 4+f? 

HE in cy py ces oes 
"T Awe4zmp 4 2 p ,I£9, — g, = J, then 
nf) = ko. 

A-2 (a) N, = 108k(9, + 9,) x 2 x 10° 
= 2 x 10 x 4 x 1072 297 = 49 x 10 
" 


so (S, + 9,)/9 
(b) F = 9/9, 


o 


-5-95,-49,F-1-49y94-5 
5,5; = J, + 9, = 69, = 1740 K 


14.5 
A-3 With FET: J, = 9 + 100 + 1.8 + 2.0 = 12.9, Ty =42.9K 


14.5 1.05 x 1860 
i tJ, =9+- > — = 28.7, Ty =58.7K 
Without: J, = 9 100 100 N 8 


Note that FET increase (S/N)4 by 58.7/42.9 = 1.37 ~ 1.4 dB 





an dui j " — | 


, icu 


AM 





3 
* 





y N 


Answers to Selected Problems 


Selected answers are given here for problems marked with an asterisk (*). 
21-8 | 0.2347, 02447, 0214? 

22-4  —K(A[mf )(sinc 2fr — cos 2«rfr] 

22-6 50%, 84% 

2.2-10 AW[sinc (QWt — 3) — sinc (2Wt + 4)] 

23-1 2Az sinc fr cos 2nfT 

23-6  (1/]a]) V(f/a)e i" 

23-313 (j4Abf)/[b? + Qf Y 


24-2 y(t) =0 pps 
= AP/2 0crt«2 
= 2A 2<t<3 
-(A/2[4 - t- 37] 3«t«5 

24-7 y(t) =0 t<0 


= [ab/(a - b)V[e™™ —e*] t»0 
t 
24-14 y(t) -i sins (4) (Hint: Use duality) 
25-5  2Ar sinc 2fr e 7?" 
25-11 2Ar sinc fr(1 + cos 4zfT) 


25-13 4 
31-3  AH(f)e Pre 


31-9 y(t) = 2B | x(A) da 


1 1 1 
31-14 h(t) = yx 90 Si e "yr, g(t) ^E e "Fur 


32-3 y(t) = 1.28 cos (wot + 72?) + 0.31 cos (Bwt + 45?) 
+ 0.14 cos (Swot + 31°) 


32-8 y(t) = 75) dx oT att — 2T) 


3.3-1 £, = 22km, g, = 56 dB, g, = 34 dB 

33-6 r=0.9m 

34-2 h(t) = Ké(t — tj) — 2BK sinc B(t — tz) cos w(t — ta) 
34-8 g(t) = 1 — e "(sin bt + cos bt) for t = 0, t, ^ 1/2.8B 
35-2 d(t)= Iim(A/7)In | Qt + 7)Qt — 7)| =0 


35-5 x(t) = 4 sin wot + $ sin 3ogf + 5; sin Swot 


A \? 2 2 
36-5 Gf) = (4) n(-4), R(t) = (4) sine 2Wr, E, = A 
3.6-10 R(T) = A?/2, P, = A*/2, Gf) = (A?/2)8(f) 
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4.1-10 
42-3 
42-9 
43-2 
43-7 
44-7 
44-12 
45-5 
5.1-4 
5.1-10 


52-5 


5.2-9 


5.2-15 
5.3-5 
§.3-11 
5.4-5 


5.4-10 


6.14 
6.1-15 
6.1-17 
6.1-20 
6.2-3 


6.3-1 
7.1-1 
7.1-3 
7.1-9 


7.1-12 
7.1-14 


ANSWERS TO SELECTED PROBLEMS 


U(t) = 400 sinc 400r e-77!9* y(t) = 800 sinc 400 cos 2771001, 
U,(t) = —800 sinc 400r sin 2771008 


Yep(t) = a(l — e^?) cos ot u(t) 
AM: B, = 400, Sr = 68; DSB: B, = 400, S; = 50 
5€ f.« 50 
K = Vb/a 
f. > 200 
Sp = 7A2/4, Br = 400 
x(t) = (A,/2)[cos ot cos wt — 2a sin wnt sin w,t] 
a = i no distortion; a = 1 or a = 0 produces SSB (max distortion) 
641) = 2n|[ft + (f ~fit?/T] 
F(t) = 40 + 40 cos 277208; Sp = 6441.5 
Br = 10", fy = 5 x 10” 
yet) = Adcos wt ~ da(mf-/Q)%(t) sin aur]; 
$(t) = arctan [6 a(nf./O)x(t) ] 
f. < (f - 2W)/9 
Signal has low frequencies boosted. 
K, = AJ2 V2 f Ky = A8 V f?; fa/fa < 0.08 
S, will increase dramatically and will have a larger impact on DSB. 
P,,/Avnax = $,/4 for DSB however P,/A2,, = S,/16 for AM. 
y(t) = {(1 + pcos[ p(t) — 6(0])9(0/2v 
+ (p + cos [Ø(t) — 82) ])pfi}/{1 + p? + 2p cos [$(t) — 642]) 
K, = 2 and K, = 7/2 
fo 12-MEZ, Woresampting = 137.5 MHz 
(a) 1.64%, (b) -14.3% 
(a) 100 Hz, (b) 200 Hz, (c) 400 KHz 


(a) H(f) = sine fr eI", X (f) = P(f)Xf) where Xf) = f, >) H(f — nf) X(f — nfs) 


(b) Half) = Ketsin? fr, |f|xW 

Br = 400 kHz 

fip Z 532.5 kHz, fio = 1072.5 to 2132.5 kHz, 10 kHz < Ber < 1065 kHz 

C = 6 to 25.6 nF, 19.3 to 3,506 nF 

(a) 50-54 MHz, 50.910 to 54.910 MHz 

(b) 50-54 MHz, 64 to 68 MHz 

(a) 2 MHz at 0 dB, 2.9 MHz at — 10 dB, 4.550 at —23.1 dB, 5.365 MHz at —25.4 dB 
(a) —24 dB, (b) —35.6 dB 
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7.2-4 
72-11 
72-13 
7.2-18 
7.3-5 
7.3-8 


7.3-13 


7.4-4 
7.4-6 
$1-1 
8.1-5 
8.1~11 
8.2-1 
8.2-5 
82-9 
$.2-15 
8.3-1 
8.3-5 
8.3-9 


8.3-15 


8.4-1 
8.4-5 
8.4-9 
8.4-17 
8.424 
9.1-1 
9.1-7 
9.1-10 
9.2-1 
9.212 
9.3-3 
9.3-7 


9.3-10 
9.314 


ANSWERS TO SELECTED PROBLEMS 


B, = 0.76W, Br = 17W 

r = 150 KHz, T = 2 us, Bp = 250 kHz 

(a) Br = 250 kHz, (b) Br ~ 250 kHz 

M = 28 

cos 6,(t) = cos [(w, + w) + bp + Q; + 90° — &,] 

f, 5 kHz, 50 kHz 

(a) A, a ees ee (b) K = 2fa 
BON el BY 

n, = 25.921, B = 805 kHz 

n, = 2.18 X 10°, Tine = 64 us, B = 4.99 MHz 

P(A*B) = 2/12 

P(C) = P(A) + P(B) — 2P(AB) 

P(all tails) = 11/24 

P(X = 2) = 0.4 

K — 0.01 

pz(z) = e €*9u(z + 5) 

prO) = e u(y) 





my = l/a 
my = &a 
B = -mx 


aw) = (1-32) 


P(i < 3) = 56/1024 

P(I > 1) = 0.264 

P(X > 20) = Q(0.5) ~ 0.31 
E[Y] = e% e" 

mz = 0, o2 = 100 

WD = (e = 1) 

Rolt 12) = 0? sin ext, — t) 
26) 5.0 

(v(t)) = +3 

Ral) = FP (U2/)647)] 
y = Not/2 

y. = NyR/AL 


z= ere 16 uv 
By = Va[2 V2a 
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9.4-1 
9.4-5 
9.4-9 
9.5-1 
9.5-5 


9.5-8 


10.1-3 
10.1-7 
10.1-10 
10.1-16 
10.2-1 
10.2-5 
102-11 
10.3-1 
10.3~6 
10.3-10 
10.4-1 
10.4-5 
10.6—1 
11.1-7 
11-11 
112-1 


11.2-4 
112-8 
11.2-13 
113-3 
11.4-1 
12.1-1 
12.1-5 
12.1-10 
12.1-13 
12.2-1 
12.2-4 


ANSWERS TO SELECTED PROBLEMS 


(S/N)p, = 61 dB 
(S/N) p = 25,/3LNW 

Hus = 5 

(o4/A)? = kT yByt/E, = 0.4 
E,= 5X10% 


T 





G(f)-N,-701 a f/f. = +405 
oy= V8—2rT-—13 

R(T) = NoBr sinc BrT cos wT 
Ran (T) = —mq NB? sinc? Byr 
(S/N)p = 50 dB 

(S/N)5 = y/2 for both channels 

gr = y/Ya ~ 1500 = 32 dB 

FM:Np = NoW?/3Sp = 1.67 x 10? 
(S/N) p = 290 = 24.6 dB 

(S/N)p, = 2030 = 33 dB 

(c) Sp ~= 200 W 

(S/N) = 90 = 19.5 dB 

(S/N) p = 2.78 X 10° = 54.4 dB 

(a) B = 160kHz, (b) M=8 

r = 0.7B 

(S/N)z = 192, P, = 6 X 107° (polar) 
ena 1 (A+2e 





(a) P.=50 20 159 A ) P,=3.7 X 1074 and 3.4 x 1075 Gf e = 0) 
P, = 1.5 X 1077 (regenerative), P, = 1.2 X 107? (nonregenerative) 
Main = 16, Sp = 670 pW 
(a) 3kpbs, (b) 4kpbs, (c) 4.8 kbps 
scrambled output: 010010111001100, dc value: 0.47 (scrambled), 0.80 (unscrambled) 
v = 3, f = 33.3 kHz, n = 3 
M =3,v=5, f, = 64 KHz 
q = 4096, A = 0.488 mV, (S/N)p = 74 dB 
24 Mbits 
M = 8,v = 2,q = 64, Sp = 567 mW 
y = 22.8 dB (PCM), y = 50 dB (analog), advantage: 27.2 dB 





12.3-2 


12.3-7 


12.3-10 
12.4-1 
12.5-1 
12.5-4 
13.1-1 
13.1-5 
13.1-10 
13.2-13 
13.2-19 
13.223 
13.3-6 


133-11 


13.3-13 


13.4-1 


14.1-1 


14.1-14 
14.2-4 
142-6 
142-14 
143-1 
143-4 
143-8 
144-1 
144-3 
144-6 
144-11 
14.5-1 
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0.372 
Josz 


W,ms = 1.3 kHz, K = ———— —— 
P d 2 arctan (W/fo) 


n= 1: c, = p,,G, = 10.1 dB, n = 2: c} = 0.9744, c; = —0.0256, G, = 10.1 dB 
5.9 Gbits 


N = 4, efficiency = 7.8% 

247 secs/page (both B channels used) 

P(no errors) = 0.6561, P(detected error) = 0.2916, P(undetected = = 0.0523 

(31, 26): y, = 7.0 dB, (31, 21): y, = 8.2 dB, (31, 16): y, = 8.3 dB, uncoded: y, = 10.5 dB 
= 107°, N = 10, r, = 269 kbps 

Odp) =P +p+1,Cp)=0+0+p+1,X=(1010011) 

(a) a = 5 X 1074, P,, = 1.5 X 1076 

X-—(1000101101010100111) 

X- (101 100 010 000 100 000 011 000) 


(a) minimum weight path: abc = D'I, d, = 3, M(d) = 1 
(b) T(D, I) — 





I 
To p, (9 Ea (9): Poe = 80°, Eq. (10): Pj, = 80°? 


Y+E=00 11 01 01 11 11 10 11 
M=0 1100100 
(a) x = {8;27,51} Sy = - 3, 6} with pg = 55,e = 7,d = 23 
: 
x= To (M - 1)(2M - B x = 1/2 (M = 2), Š$ = 3/4 (M > 1) 
r, = 385 bps 
P, = O(V1.216y,) 
0, < arccos (3.74/4.27) = 29° 
(a) A, = 0.00133, (b) A, = 0.0023 
y, > 10.9 dB, Pa < 3 x 1074 
(a) P, = 3.4X 1075, (b) P,— 176 x 10 * 
(a) 11.8 kbps (b) 23.5 kbps (c) 28.4 kbps 
(a) y, = 10.5 dB, (b) y, = 12.8 dB 
(a) 16-QAM with y, = 14.4 dB, (b) 16-PSK with y, = 18.4 dB 
5.2 
(a) P, = 2.32 X 107, (b) P, = 1.8 X 1079, (c) P, = 0.4, (d) P, = 0.017 
P, = 1.6 x 10^!!, output symbol rate does not change 


830 


14.5-3 


15.1-1 
15.1-5 
15.2-1 
15.2-4 
15.3-5 
154-3 
16.1-1 
16.1-4 


16.1-9 


16.1--16 
16.2-8 
16.3-1 


16.3-5 


16.3-10 
163—14 


16.4-5 


16.5-1 


ANSWERS TO SELECTED PROBLEMS 


Xx, = {00, 01, 10, Ol, 11, 00) 
y3y2y, = (000, 100, 011, 010, 111, 111} 
(a) W, = 203 kcps, (b) By = 0.41 MHz 
W, > 487 Kcps 
Pg = 2! = 256, B, = 768 kHz 
P, = 495 X 10? 

(0000 0000 1001 0100 1001 1110 1010 
Tos = 0.51 secs, Oacg = 0.38 secs 

I(not F) = log 5/4 = 0.322 bits 

H(X) = 1.94 bits 


1 1 2 1 
H(X) = 3 log 3 a ben + (2 - p Jio 5 
rm 





p 
H(X) — 0.811 bits 

C, = 0.577 bits/symbol 

H(X) = log V 12S 


p(x) = > etul) H(X) = log em 


$m 10- "(258/400 22 1) 
Sr = yLN\W 


(a) z 700.5) & = (52) 


20 9E 16 9E 
Reo seh 
u2(f) = 4R kT, + 4R kT, 

(f) ~= 4(r/2)kT, r= kF/ql 
F = 26 
T s 103K 





110), |R.(7) |max = 0.29 


| 2 v2 8 
s= V2 d NEP T ENT 








SUPPLEMENTARY READING 


Listed below are books and papers that provide expanded coverage or alternative 
treatments of particular topics. Complete citations are given in the References. 


Communication Systems 


The following texts present about the same general scope of communication sys- 
tems as this book: Schwartz (1990), Ziemer and Tranter (1995), Couch (1995), 
Roden (1996), Lathi (1998), and Haykin (2001). A somewhat more advanced 
treatment is provided by Proakis and Salehi (1994). See also Kamen and Heck 
(1997) or Proakis and Salehi (1998) for additional MATLAB material. 

Bellamy (1991) provides details of digital telephony. Optical systems are dis- 
cussed by Gagliardi and Karp (1995) and by Palais (1998). 


Fourier Signal Analysis 


Expanded presentations of signal analysis and Fourier methods are contained in 
Lathi (1998) and Stuart (1966). Two graduate texts dealing entirely with Fourier 
transforms and applications are Bracewell (1986), which features a pictorial dic- 
tionary of transform pairs, and Papoulis (1962), which strikes a nice balance 
between rigor and lucidity. 

Advanced theoretical treatments will be found in Lighthill (1958) and Franks 
(1969). The article by Slepian (1976) expounds on the concept of bandwidth. 


Probability and Random Signals 


Probably the best general reference on probability and random signals is Leon- 
Garcia (1994). Other texts in order of increasing sophistication are Drake (1967), 
Beckmann (1967), Peebles (1987a), Papoulis (1984), and Breipohl (1970). 

The classic reference papers on noise analysis are Rice (1944) and Rice 
(1948). Bennett (1956) is an excellent tutorial article. 


CW Modulation and Phase-Lock Loops 


Goldman (1948), one of the earliest books on CW modulation, has numerous 
examples of spectral analysis. More recent texts that include chapters on this sub- 
ject are Stremler (1990), Ziemer and Tranter (1995), and Haykin (2001). 

Detailed analysis of FM transmission is found in Panter (1965), a valuable 
reference work. Taub and Schilling (1986) gives clear discussions of FM noise and 
threshold extension. The original papers on FM by Carson (1922) and Armstrong 
(1936) remain informative reading. 

The theory and applications of phase-lock loops are examined in depth in 
Brennan (1996), which also includes a discussion of noise in PLLs. 
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SUPPLEMENTARY READING 


Sampling and Coded Pulse Modulation 


Shenoi (1995) presents sampling and digital signal processing with emphasis for 
telecommunications. Oppenheim, Schafer, and Buck (1999) is a classic book that 
covers sampling and digital signal processing. Ifeachor and Jervis (1993) presents 
many of the practical aspects of sampling. Other discussions of sampling are in 
papers by Linden (1959) and Jerri (1977). 

Oliver, Pearce, and Shannon (1948) is a landmark article on the philosophy 
of PCM while Reeves (1965) recounts the history of his invention. The book by 
Cattermole (1969) is entirely devoted to PCM. Jayant and Noll (1984) covers the 
full range of digital encoding methods for analog signals. 


Digital Communication and Transmission Methods 


Undergraduate-level texts that cover digital communication are Gibson (1993), 
Couch (2001), and Haykin (2001). Graduate-level treatments are given by Proakis 
(2001), Sklar (2001), and Lindsey and Simon (1973). Also see the books listed 
under Detection Theory. 

The following references deal with specific aspects of digital transmission: 
Fehr (1981) on microwave radio; Mitola (2000) on software radio; Spilker (1977) 
and Sklar (2001) on satellite systems; Dixon (1994) and Peterson, Ziemer, and 
Borth (1995) on spread spectrum; Tannenbaum (1989) and Stallings (2000) on 
computer networks; Lewart (1998) on modems; Ungerboeck (1982), Biglieri, 
Divsalar, McLane, and Simon (1991) and Schlegel (1997) on trellis coded modu- 
lation systems; and Rappaport (1996) on wireless communications. 

Of the many papers that could be mentioned here, the following have special 
merit: Arthurs and Dym (1962) on optimum detection; Lender (1963) on duo- 
binary signaling; Lucky (1965) on adaptive equalization; Gronemeyer and 
McBride (1976) on MSK and OAPSK; Oetting (1979) on digital radio. 


Coding and Information Theory 


Abramson (1963), Hamming (1986), and Wells (1999) provide very readable 
introductions to both coding and information theory. Also see Chaps. 4—6 of Wil- 
son (1996) and Chaps. 4, 7, and 8 of Lafrance (1990). Mathematically advanced 
treatments are given by Gallager (1968) and McElice (1977). 

Texts devoted to error-control coding and applications are Wiggert (1978), 
Lin (1970), and Adámek (1991) at the undergraduate level, and Berlekamp (1968), 
Peterson and Weldon (1972), Lin and Costello (1983), Sweeney (1991) and 
Wicker (1995) at the graduate level. A history of coding and a treatment of code- 
breakers and the present encryption systems is given in Singh (1999). 

Introductions to information theory are given by Blahut (1987) and Cover and 
Thomas (1991). The classic papers on the subject are Nyquist (1924, 1928a), 
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Hartley (1928), and Shannon (1948, 1949). Especially recommended is Shannon 
(1949), which contained the first exposition of sampling theory applied to commu- 
nication. À fascinating nontechnical book on information theory by Pierce (1961) 
discusses implications to art, literature, music, and psychology. 


Detection Theory 


The concise monograph by Selin (1965) outlines the concepts and principles of 
detection theory. Applications to optimum receivers for analog and digital commu- 
nication are developed in Sakrison (1968), Van Trees (1968), and Wozencraft and 
Jacobs (1965). The latter includes a clear and definitive presentation of vector 
models. Viterbi (1966) emphasizes phase-coherent detection. Tutorial introduc- 
tions to matched filters are given in the papers by Turin (1960, 1976). 


Electrical Noise 


There are relatively few texts devoted to electrical noise. Perhaps the best general 
reference is Pettai (1984). Useful sections on system noise are found in Freeman 
(1997), Johns and Martin (1997), and Ludwig and Bretchko (2000). Noise in 
microwave systems is described by Siegman (1961) using the informative 
transmission-line approach. Electronic device noise is treated by Ambrózy (1982) 
and Van der Ziel (1986). 


Communication Circuits and Electronics 


The design and implementation of filter circuits are detailed in Hilburn and John- 
son (1973) and Van Valkenburg (1982). Recent introductory treatments of commu- 
nication electronics are found in texts such as van der Puije (1992), Tomasi 
(1998), and Miller (1999). More advanced details are given by Clarke and Hess 
(1971), Krauss, Bostian, and Raab (1980), Smith (1986), Freeman (1997), Johns 
and Martin (1997), and Ludwig and Bretchko (2000). 
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Nyquist pulse shaping and, 461—462 
inverse Fourier transform, 34 
inversions, 438—439 


jamming. See also interference 
DSS and, 672-679 
FH-SS and, 679-684 
jitter, 443, 478 
Johnson, J. B., 12 
Johnson noise, 372 
joint probability, 316, 329-330 


Kerr factor, 290 
kinetic theory, 372 
Kraft inequality, 706, 718 


Lagrange’s undetermined multipliers, 724 
Lamarr, Hedy, 672 
LAN (local area networks), 538 
lands, 522 
Laplace distribution, 333, 504 
Laplace transform, 38 
LC parallel resonant circuit, 208 
LFE (low-frequency effect), 300 
limiters, 205-207 
linear block codes 
cyclic, 567—573 
M-ary, 573 
matrix representation of, 560—564 
syndrome decoding and, 564—567 
linear distortion, 90—94, 202-205 
linear envelope detector, 177 
line-of-sight ratio propagation, 8, 106 
line spectra. See also spectra 
convergence conditions and, 29 
Fourier series and, 25-32 
Gibbs phenomenon and, 31 
impulses in frequency and, 62 
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Parseval's theorem and, 31-32 
periodic signals and, 23-24 
phasors and, 19~23 
loading effects, 86 
lock-in, 278-281 
LOH (line overhead), 535 
log amplifiers, 160 
logarithms, 100 
loop gain, 280 
lowpass equivalent spectrum, 146 
lowpass equivalent transfer function, 147 
lowpass-to-bandpass transformation, 147 
LPC (linear predictive coding), 520-522 
LPF (lowpass filters), 84, 109, 119 
color signals and, 299 
correlative coding and, 471-472 
digital signals and, 436 
DM and, 514 
DSS and, 675-676 
ideal, 57-58 
matched, 454-457 
PCM and, 495-496 
white noise and, 441 
LT1 (linear time-invariant) systems 
block-diagram analysis, 85-88 
frequency response and, 80—85 
modulators and, 158 
superposition integral and, 77-80 
lumped parameter elements, 77—78, 83 


MA (multiple access), 10, 107—108, 678-679 
Maclaurin series, 60 
majority logic gate, 591 
MAP (maximum a posteriori) receivers, 
593—594, 741-747 
M-ary coding, 440, 573, 618 
ASK waveform and, 614—617 
BSC and, 719-722 
error probabilities, 457-461 
FH-SS and, 680-684 
FSK and, 619-622 
memoryless channel and, 702, 705-709 
optimum digital detection and, 740-754 
PSK systems and, 646-650 
QAM systems and, 650-653 
systems comparison and, 653-655 
TCM and, 655-665 
matched filters, 388-391, 454—457 
matched loads, 374 
mathematics, 13, 18. See also coding; 
equations; information theory 
average code length, 705 
Bayes’ theorem, 318 
BCJR algorithm, 594 
Bessel functions, 191 
binomial distribution, 337-338 
block diagram analysis, 85-88 
Boltzmann constant, 372 
Butterworth polynomials, 114 
Carson’s rule, 201 
Chebyshev’s inequality, 332-333, 721 
convolution, 52-58 
correlation, 124—130, 353-356 
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mathematics— Cont. 
cosine function, 20, 617, 623 
decision function, 744 
differentiation theorem, 51, 204 
Dirac delta function, 58—61 
duality theorem, 42-43, 48 
Euler's theorem, 19—20, 49 
Fourier series, 25-32 
Fourier transforms, 33-52, 80-85 
Galois fields, 568 
Gaussian PDF, 339-342, 344-345 
Gaussian process, 362 
generalized functions, 58-61 
generator polynomial, 568—569 
Gram-Schmidt procedure, 738—740 
Hilbert transforms, 120—123, 371 
horseshoe function, 703 
integration theorem, 51 
interchange of integral operations, 42 
Laplace distribution, 504 
Laplace transform, 38 
logarithms, 100 
matrices, 469, 519, 560—564, 589—590 
mean, 105, 331, 335-336 
modulation theorem, 49 
modulo arithmetic, 479—481, 602—603 
orthonormal basis functions, 738 
parabolic function, 414 
Planck constant, 372 
Poisson distribution, 338—339 
probability and, 311—350 (see also 
probability) 
Rayleigh PDF, 342-344 
Rayleigh's theorem, 40—42 
rectangular pulse train, 27—29 
sampling theory, 26, 232-245 
scalar product, 736 
Schwarz's inequality, 736—737 
signum functions, 64-66 
sine function, 20, 26, 623 
source coding theorem, 706 
spectral density functions, 130-135 
statistical averages, 330-337 
step functions, 64-66 
triangular function, 52, 736 
vectors, 741—747 
vestigial symmetry theorem, 462 
Viterbi algorithm, 586-589, 658 
Wiener-Kinchine theorem and, 362-363 
matrices, 469, 519 
block codes and, 560—564 
decoding and, 589-590 
generator, 561 
parity check, 564 
maximum information transfer, 717 
maximum likelihood detection, 741—745 
maximum phase deviation, 190, 202 
maxiumum likelihood decoder, 565, 583, 585 
maxiumum phase shift, 185 
mean, 331, 335-336 
mean time delay, 105 
memory channels, 575, 709-713 
memoryless channels, 702, 705-709 


messages. See signals 
metrics, 587 
microwaves, 13 
minimum weight nontrivial path, 580 
mixing, 172 
ml sequences, 686—688 
models. See mathematics 
modems, 665 
modified duobinary signaling, 474 
modulation, 3. See also specific types 
analog pulse, 426-429 
applications, 7-10 
balance and, 161-162 
complex, 48-50 
cross talk and, 267—268 
demodulation, 4, 6 
double-sideband AM, 152-158, 164—172 
exponential CW, 183—230 
index, 152, 190 
linear CW, 141-182 
methods of, 6-7 
multitone, 196—199 
periodic, 196-199 
product modulators and, 158—160 
random signals and, 367-368 
square-law modulators and, 160-161 
suppression effect and, 419 
switching and, 162-163 
systems comparison and, 653—655 
theorem, 49 
tone, 189-196 
modulo arithmetic, 479—481, 602—603 
moments, 332—333 
monochrome signals, 292—294 
monotonic functions, 327-328 
Morse, Samuel, 2, 12, 708 
MPEG (Motion Picture Expert Group), 302 
MSK (minimum shift keying), 622-626 
MTSO (mobile telephone switching office), 
693 
multipath distortion, 96 
MUX (multiplexing), 10, 149 
color signals and, 298-299 
cross talk and, 276-277 
data and, 535-537 
digital, 526—537 
DSL and, 530-532 
frequency division, 266—271, 277-278 
guard times and, 276-277 
hierarchies and, 527—530 
ISDN and, 532-533 
quadrature carrier, 271 
SONET and, 533-535 
time division, 272—278 
multitone modulation, 196-199 
multivariate expectations, 334—336 
mutual information, 713-717, 725—726 


NAK (negative acknowledgement), 556-559 

narrowband ISDN, 533 

narrowband tone modulation, 190 

NBFM (narrowband frequency modulation), 
188-191, 210-211 


NBPM (narrowband phase modulation), 188—189 
near-far problem, 693 
net area, 35 
networking, 537-541 
NIST, 598, 600 
noise, 4-5, 371, 430-434. See also AWGN 
(additive white gaussian noise) 
additive, 382-383, 726 
analog modulation and, 383-386, 397-434 
bandpass, 398—406 
baseband signal and, 381--391 
binary error and, 448—453 
BSCs and, 721 
CDs and, 523 
correlative coding and, 470-476 
CW systems comparison and, 422-424 
decoding of, 505-506 
DSS and, 672-679 
equalization and, 467-468 
equivalent bandwidth and, 378-380 
exponential CW modulation and, 412-422 
figure, 264 
filtered, 375—378 (see also filters) 
granular, 514 
independent additive, 726 
irrelevant, 742 
linear CW modulation and, 406-412 
margin, 442-443 
M-ary error and, 457-461 
mathematics and, 13 
modulation and, 8, 10 
Nyquist pulse shaping and, 461-464 
PCM and, 497, 504—510 
phase-lock loop performance and, 425—426 
postdetection, 412—415 
predetection, 399—400 
PN (pseudonoise), 482-485, 679—692 
pulse measurement and, 386-388 
quantization and, 499—501 
quieting, 415 
repeaters and, 453—454 
scrambling and, 479—483 
signal ratio and, 382-383 
sinusoid envelope and, 634—636 
synchronization and, 476-485 
thermal, 5, 372-375 
white, 375-378, 380-381 
noncoherent binary systems 
bandpass noise and, 634—636 
FSK and, 638-640 
OOK and, 636—638 
PSK and, 640—644 
nonlinear distortion, 97-99, 205-207 
nonlinear signal compression, 502 
nonperiodic energy signal, 34 : 
normalization, 24 
notch filters, 109 
NRZ (nonreturn-to-zero) format, 438, 614, 
685 
NRZI (nonreturn-to-zero inverse), 524 
NTSC (National Television System Commit- 
tee), 287-288, 291, 299-300, 302 _ 
Nyquist, Harry, 12, 372, 441-442, 698 
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Nyquist pulse shaping, 461—464 
AM with digital CW modulation and, 616 
cerrelative coding and, 471—472 
digital CW modulation and, 612—613 
Nyquist rate, 235-236, 243, 519 


OC-N signal, 534 

octets, 534 

odd-harmonic distortion, 212 

odd symmetry, 37-38 

on-off waveform, 438 

OOK (on-off keying), 614 
coherent, 631-632 
noncoherent, 634, 636-638 
systems comparison and, 653-655 

optical communications, 14 

optimum digital detection, 626—631 
error probabilities and, 747—751 
MAP receivers and, 741—747 
signal selection and, 751—754 
signal space and, 735-740 

OQPSK (offset quadriphase phase-shift 

keying), 618 

OR gate, 474, 563, 600 

orthogonal signaling, 751—754 

orthonormal basis functions, 738—740 

oscillators, 117, 208-209 

oscilloscopes, 243-245 

OSI (Open Systems Interconnection), 538-539 

oversampling, 243 

overshoot, 117 


packet switching, 536 
PAM (pulse-amplitude modulation), 7, 11, 
245-248 ` 
digital, 437-448 
formats of, 438 
Nyquist shaping and, 461—464 
optiumum terminal filters and, 464—467 
precoding and, 446-448 
spectra of, 443-448 
transmission limitations and, 440—443 
parabolic function, 414 
parallel connection, 86 
parity check, 549-550, 564, 590 
Parseval's theorem, 31-32 
passband, 84, 109 
pattern recognition, 128-129 
P-box, 597 
PCC (parallel concatenated codes), 592-594 
PCM (pulse-code modulation) 
analog modulation and, 508-510 
CDs and, 523-526 
channel bank and, 529-530 
error threshold and, 507—508 
generation and reconstruction, 495-499 
ideal communication and, 728 
with noise, 504—510 
quantization, 499-504 
systems comparisons and, 733-735 
PCS (personal communications systems), 693 
PDF (probability density function), 325, 636 
binary error and, 449, 451 


continuous channels and, 725-727 
MAP receivers and, 744 
marginal, 330 
M-ary error and, 458 
M-ary PSK systems and, 646-650 
uniform, 326-327 
PDM (pulse-duration modulation), 248-251, 
428 
periodgrams, 364 
periodic modulation, 196-199 
periodic signals, 23-24 
periodic triangular function, 213 
periodic waves, 197 
permutation, 597 
Pg (process gain), 677 
phase 
angle, 19 
bandpass noise and, 403—404 
BPSK and, 617 
comparison detection, 640 
delay, 92 
distortion, 91 
jitter, 425-426 
PRK and, 617 
phase shift, 91 
discriminators and, 217 
double-sideband AM, 152—158 
frequency response and, 82 
maximally linear, 114 
PM and, 617-619 
random signals and, 355—356 
SSB generation and, 168 
triangular wave FM and, 213 
phasors, 19—23 
Correlation of, 126—127 
frequency response and, 62, 81 
interference and, 219 
Steady state response, 83 
tone modulation and, 157—158, 190, 194 
Pierce, J. R., 13 
pilot carrier, 175 
Pinto, Joao, 299n 
pits, 522 
Planck constant, 372 
PLL (phase-lock loop), 175 
bit synchronization and, 477 
color signals and, 298 
DSS and, 498 
frequency synthesizers and, 281—284 
linearized, 285—286 
lock-in and, 278-281 
M-ary PSK systems and, 649 
noise and, 425—426 
spectrum, 25, 28 
synchronous detection and, 281—284 
trackers, 302 
PM (phase modulation), 6 
deemphasis filter and, 222 
digital CW modulation and, 617-619 
exponential CW and, 184—199, 412-422 
indirect FM and, 209-212 
interference and, 220 
linear distortion and, 202-205 
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narrowband, 188—189 
S/N (signal-to-noise) ratio and, 416-417 
systems comparison and, 423 
tone modulation and, 189-196 
PN (pseudonoise), 482-485 
coding and, 684—689 
DSS and, 673—679 
FH-SS and, 679-684 
ranging and, 688-689 
synchronization and, 689-692 
POH (path overhead), 535 
point-to-point communication, 422 
Poisson distribution, 338—339 
polar signal, 438, 452 
postdetection noise, 412—415 
POT (plain old telephone), 530-531 
power 
amplifier, 212 
average, 23-24, 32, 153, 444 ` 
Correlation of, 124—127 
entropy and, 724—725 
gain, 99—101 
jamming, 676 
noise and, 372-375, 378 
peak envelope, 155-156 
per sideband, 156 
predetection noise and, 400 
quadrature components and, 402 
random signals and, 362-367 
spectral density and, 132 
superposition and, 32, 367-368 
PPM (pulse-position modulation), 248—250 
orthogonal signaling and, 752 
S/N (signal-to-noise) ratio and, 428 
spectral analysis and, 251—253 
systems comparisons and, 733—735 
practical sampling, 240-245 
preamble, 690 
precoding, 446—448, 471, 474. See also coding 
precursors, 110, 117 
predetection noise, 399—400 
prediction error, 510-511 
prediction gain, 519 
predictive coding, 709—713 
preemphasis filtering, 221—223 
preferred pairs, 687 
prefix, 484 
PRI (primary rate interface), 532 
principle of superposition, 77-80 
PRK (phase-reversal keying), 617 
probability. See also coding; information theory 
APP, 593-594 
coding and, 720-721 
conditional, 317-320, 330 
discrete channels and, 713-722 
DSS and, 676 
forwerd transition, 714 
jeint, 329-330 
letter occurence and, 709—710 
M-ary, 457-461 
models for, 337-345 
optimum digital detection and, 747-751 
random variables and, 320-330 
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probability—Cont. 
Rayleigh curve and, 642 
Rician distribution and, 642 
sample space and, 312-320 
statistical averages and, 330—337 
three fundamental axioms of, 315 
word transmission, 549, 557 
product modulators, 158-160 
projection, 737 
pseudo-trinary inversion, 438-439 
PSK (phase-shift keying), 612-613 
DSS and, 673 
M-ary systems and, 646—653 
noncoherent, 634, 640-644 
systems comparison and, 653-655 
TCM and, 655-665 
PTE (path terminating element), 533-534 
public key encryption, 596 
pulses ` 
analog modulation and, 426-429 
chipped, 673—674, 676 
false, 429 
filters and, 388-391 (see also filters) 
noise and, 386—388 
Nyquist shaping and, 461-464 
raised cosine, 68-70 
synchronization and, 442, 476-485 
pulse-time modulation, 248—253 
puncturing, 593 
PWM (pulse-width modulation), 248 


QAM (quadrature amplitude modulation), 145, 
190, 271 
bandpass digital transmission and, 644—646 
digital CW and, 615-618 
M-ary systems and, 650—653 
systems comparison and, 653-655 
QPSK (quadri phase-shift keying), 618, 
644—646 
TCM and, 655-665 
quadrature detector, 217 
quadrature filters, 120—123 
bandpass noise and, 401—402 
exponential CW with noise and, 414 
quantization, 495—496 
DPCM and, 519 
noise and, 499—501 
nonuniform, 501—504 
quasi-static approximation, 204, 251 
quaternary signal, 439—440 


radar, 13 
radian frequency, 19 
radios, 2, 5, 11-13 
parameters of, 259 
transmission loss and, 106-109 
raised cosine pulse, 68—70 
ramp generator, 250, 265 
random signals, 444 
binary error probabilities and, 448-449 
CDs and, 525 
coding and, 720-721 


correlation functions and, 353-356 
ensemble averages and, 353-356 
ergodic processes and, 357-362 
filtered, 368-371 
modulation and, 367-368 
power spectrum and, 362—367 
Stationary processes and, 358—362 
superposition and, 367—368 

ranging, 688—689 

raster lines, 290 

rate distribution theory, 735 

rate reduction, 712 

ratio detector, 217 

Rayleigh curve, 636, 642 

Rayleigh PDF, 342-344 

Rayleigh's theorem, 40-42, 82 

real symmetry, 38 

receivers, 4 
direct conversion, 262 
monochrome, 292—294 
scanning spectrum analyzers, 265-266 
special-purpose, 262-264 
specifications of, 264-265 
superhetrodyne, 258-262 

reciprocal scale change, 46 

reciprocal spreading, 36, 66—67 

reconstruction, 7, 237—240 

recording, 522-526 

rectangular pulse train, 27-30 
Hilbert transform and, 122-123 

redundancy, 710 

Reeves, Alec, 13 

regeneration, 440, 450 

rejection filter, 302 

relative frequency of occurrence, 313 

repeaters, 60, 549—550 
Internet and, 540-541 
regenerative, 453-454 
transmission loss and, 101—102 

resistance noise, 372 

resolution, 287-291 

RF (radio-frequency), 209, 259-260 

RF choke, 209 

RFI (radio-frequency interference), 4 

Rician distribution, 635, 642 

ring modulators, 161-162 

risetime, 118—120 

RLC load, 162 

rms bandwidth, 514 

rolloff, 463, 615, 623 

routers, 540 

RS (Reed-Solomon) codes, 301, 573 

RSA (Rivest-Shamir-Adleman) system, 594, 

602-603 
RSC (recuisive systematic convolutional) 
encoders, 592-594 

run, 424 

r.v. (random variables) 
conditional probability and, 330 
continuous, 323—327 
discrete, 320-323 
jeint probability and, 329—330 


statistical averages and, 330—337 
transformations of, 327—329 
RZ (return-to-zero) format, 438, 477 


sampling, 7, 60 
chopper, 232-237 
function, 26, 353 
ideal, 237—240 
oscilloscopes, 243-244 
PAM and, 245-248 
practical, 240-245 
S/H (sample-and-hold) technique, 245—246 
space, 313—317 
satellites, 13, 107—109, 268-269 
SAW bandpass filter, 261 
S-box, 597 
SCA (Subsidiary Communication 
Authorization), 270 
Scalar products, 124, 736 
scale change, 46-48 
scanning spectrum analyzer, 265-266 
scattering, 105 
Schwarz’s inequality, 124, 389, 736—737 
scrambling, 479—483 
SDH (Synchronous Digital Hierarchy), 533 
SDSL (symmetrical digital subscriber line), 532 
secret key encryption, 595 
selective-repeat scheme, 557 
selectivity, 264 
self-information, 700 
sensitivity, 264 
sequential decoding, 588—592 
Shannon, Claude, 13, 698. See also information 
theory 
fundamental theorem for noisy channel, 
711—718 
Source coding theorem, 706 
Shannon-Fano coding, 708-709 
shift register, 479-483 
decoding and, 585-592 
frame synchronization and, 484—485 
Gold codes and, 686—688 
scrambling and, 479—483 
Sideband reversal, 261 
sidelobes, 619 
signals, 3, 136-139. See also coding, 
modulation 
average power and, 23-24 
bandpass frequency response, 142-151 
bipolar, 438, 447 
classes and, 18 
constellation and, 616 
correlation and, 124—130 
DSS, 672-679 
duobinary, 472, 474-476 
filters and, 109—123 (see also filters) 
Fourier and, 25-32 (see also Fourier series; 
Fourier transforms) 
Gram-Schmidt procedure and, 
738-740 
LT1 system response and, 76-88 
MAP receivers and, 741—747 








multiplexing and, 526-537 (see also 
multiplexing) 
nonlinear compression and, 502 
orthogonal, 751-754 
partial response, 470-476 
periodic, 23-24 
polar, 438, 452 
random, 352-371 (see also random signals) 
rate, 438 
sampling and, 232—245 (see also sampling) 
scale change and, 46—48 
shape of, 4 
space and, 735—740 
spectra and, 17-74, 130-135 (see also 
spectra) 
strength and, 4 
synchronization and, 476-485 
television, 287-291 
time delay and, 45-46 
transmission and, 88—109 
unipolar, 448, 452 
as vectors, 735-738 
sign inversions, 438-439 
signum functions, 64—66 
sinc pulse, 43, 442 
sine function, 20, 26 
sinusoids 
bandpass noise and, 634—636 
carrier waves and, 6 
correlation of, 126-127 
interference and, 219-221, 676-678 
linear CW modulation, 141-182 
random signals and, 355-356 
sampling and, 234 
thermal noise and, 374 
sliding-correlator, 691 
slope detection, 216 
slope loading factor, 514 
slope overload, 513, 515 
slow hop SS, 680-681 
smart electronics, 11-12 
smoothing, 54 
S/N (signal-to-noise) ratio, 264, 382-383. See 
also noise 
analog pulse and, 426-429 
coding and, 548 
continuous channels and, 722-731 
destination, 416—417 
exponential CW with noise and, 412-422 
M-ary error and, 459 
power and, 5 
predetection and, 401 
repeaters and, 453-454 
synchronous detection and, 407-409 
TCM and, 655 
soft decision, 664—665, 731 
SONET (Synchronous Optical NETwork), 
533-535 
SPE (synchronous payload envelope), 535 
special-purpose receivers, 262-264 
spectra, 5, 9 
AM with digital CW modulation, 615 


available density and, 374 
bandlimited digital PAM, 462 
continuous, 33—44 
convergence conditions and, 29 
convolution and, 52-58 
cosine rolloff, 463 
cross density and, 367 
density functions, 130-135 
digital PAM, 443-448 
direct sequence spread, 672-679 
DSB signals and, 154—156 
Fourier series and, 25-32 
Fourier transforms and, 33-44 
frequency hop spread, 679—684 
Gibbs phenomenon arid, 31 
impulses, 58—70 
line, 19-32 
optimum terminal filters and, 465 
Parseval’s theorem and, 31-32 
periodic signals and, 23-24 
phasors and, 19-23 
PPM and, 251-253 
precoding and, 446-448 
PSK, 618 
pseudonoise coding and, 684—689 
quadrature components and, 402 
random signals and, 362—367 
Scanning analyzer, 265—266 
spread systems and, 671—695 
SSB signals and, 164-167 
superposition and, 367—368 
synchronization and, 689—692 
television and, 286-303 
time/frequency relations, 44—52 
tone modulation and, 192—195 
VSB signals and, 170-172 
wireless telephone systems and, 692-693 
speech synthesis, 520—522 
speed, 5 
Spikes, 418 
split phase Manchester format, 439 
square integrable condition, 29 
square-law modulators, 160—161 
SSB (single-sideband modulation) 
direct conversion receivers and, 262 
frequency synthesizers and, 284 
generation, 167-169 
interference and, 220 
multiplexing and, 274 
reversal and, 261 
spectra and, 164—167 
synchronous detection and, 408—409 
systems comparisons and, 733-735 
stability, 8! 
standard deviation, 332-333 
state diagram, 576-577, 581—582 
state variable, 581 
stationary processes, 352, 357—362 
Statistics 
averages and, 330—337 
characteristic functions, 336-337 
Chebyshev's inequality, 332-333 
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expectations, 331, 334—336 

independence, 318—319 

mean, 331, 335-336 

moment, 331 

standard deviation, 332~333 

time division multiplexing, 536 
step functions, 64—66, 78 
Stirling's approximation, 722 
Stochastic processes, 352 
stop-and-wait scheme, 557 
Stopband, 109 
store and forward switching, 536 
STS (synchronous transport signal), 534 
stuff bits, 528 
Sunde's FSK, 620, 623 
superhetrodyne receivers, 258-262 
superposition, 45 

average power, 32 

frequency reponse and, 81 

integral, 77-80 

noise and, 382 

random signals and, 367-368 
surround sound, 300 
switched capacitor filter, 115 
switching function, 162-163, 210, 233 
SX (simplex) transmission, 4 
symmetry 

binary channels and, 716—722 

even/odd, 37-38 

hermitian, 35 

real, 38 

vestigial, 462 
synchronization, 476 

acquisition and, 689—691 

bit, 477-478 

coherent binary systems and, 633—634 

detection, 172-176, 406—409 

frame, 484—485 

Nyquist pulse shaping and, 463 

PLL and, 278—286 

PPM and, 250 

scrambling and, 479-483 

tracking and, 691-692 

transmission limitations and, 440-443 
syndrome decoding, 564—567, 590 
synthesizing, 520-522 


table-lookup decoding, 565—566 
tangential sensitivity, 429 
tank circuit, 162 
tapped-delay-line equalizer, 94 
Tau-Dither loop, 691—692 
TCM (trellis-coded modulation), 301, 576—577, 
623, 655 
basics of, 656-664 
hard/soft decisions and, 664—665 
modems and, 665 
TCP/IP (Transmission Control 
Protocol/Internet Protocol), 539-541 
TDM (time-division multiplexing), 10, 
272-278 
TDMA (time-division multiple access), 108 
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telegraphs, 2, 11, 441 
random signals and, 365—367, 370 
telephones, 2, 13, 692 
cellular, 13, 693 
multiplexing and, 10, 266 
teletypewriter, 13 
thermal noise, 5, 372-376 
Thevenin model, 373-374 
threshold effect 
false-pulse, 429 
frame synchronization and, 485 
FM (frequency modulation) and, 418-422 
linear CW with noise and, 411-412 
M-ary error and, 458 
optimum digital detection and, 754 
time 
complex modulation and, 48-50 
convolution and, 52-58 
delay, 45-48, 91-92, 105, 175 
DLL, 691-692 
duality theorem and, 42-43 
first-order system response and, 79-80 
Fourier series and, 25-32 
Fourier transforms and, 33-44 
frequency and, 44~52 
impulses and, 66-68 
limiting, 33, 111-112 
position error, 387 
sampling and, 232-245 
scale change, 4648 
superposition and, 77 
symmetry and, 37-38 
TOH (transport overhead), 535 
tone modulation, 143, 157-158 
exponential CW and, 189-196 
total instantaneous angle, 185 
tracking, 691-692 
trailing edge modulation, 249 
transducers, 3 
transfer functions 
correlative coding and, 472 
frequency response and, 80-85 
transform functions. See also Fourier transforms 
block diagram analysis and, 85-88 
distortion and, 88-99 
filters and, 109-120 
Hilbert, 120-123 
transforms in the limit, 69-70 
in frequency, 61-64 
impulse properties and, 58-61 
step and signum functions, 64-66 
in time, 66-68 
transistors, 13 


transmission. See also modulation; signals 
analog, 383-386 
bandwidth and, 5, 199-202 
baseband pulse with noise, 381—391 
channel capacity and, 3, 6 
distortion and, 84, 88-99 
fiber optics and, 102-106 
full-duplex, 4-5 
half-duplex, 5 
limitations in digital, 440-443 
linear distortion and, 202-205 
loss, 99-109 
monochrome, 292~294 
nonlinear distortion and, 205-207 
simplex, 4 

transponders, 172-173 

transversal filter, 94, 519 

trapezoidal pulse, 56 

tree code, 707 

TRF (tuned-RF), 262 

triangular function, 51-52, 736 

triangular wave FM, 212-214 

triple DES, 601 

tuning ratio, 261 

turbo codes, 592-594 

TV (television), 2, 11, 13 
bandwidth and, 5 
color, 294-299 
digital, 14 
high definition, 288, 299-303 
monochrome, 292-294 
multiplexing and, 10 
signal properties and, 287~291 

twinned binary format, 439 


UDP (user datagram protocol), 541 
union bond, 750 

union event, 315 

unipolar signals, 448, 452 
uniquely decipherable, 705 

unit impulse. See impulses 

unit step, 67 

up-sampling, 526 


variable-reactance element, 208 
variable transconductance multiplier, 
159 
variance, 332 
VCC (voltage-controlled clock), 477—478 
VCO (voltage-controlled oscillators), 208—209, 
212 
M-ary PSK systems and, 649 
spectrum analyzers and, 265 


threshold effect and, 421-422 
tracking and, 691—692 
VDSL (very-high bit rate digital subscriber 
line), 532 
vectors, 552-553 
block codes and, 560-564 
MAP receivers and, 741-747 
optiumum digital detection and, 740-754 
signals as, 735-740 
signal selection and, 751-754 
syndrome decoding and, 570-573 
weight, 560-561 
vestigial symmetry theorem, 462 
video, 287-291 
VIR (vertical-interval reference), 204 
virtual circuit, 536 
Viterbi algorithm, 586-589, 658 
VITS (vertical-interval test signal), 294 
VLSI technology, 243 
vocoders, 521 
voltage-tunable bandpass amplifier, 265 
VSB (vestigial sideband modulation), 164 
AM with digital CW modulation and, 616 
8VSB for HDTV and, 302 
spectra and, 170-172 
synchronous detection and, 408 
systems comparison and, 422-424 
VT (virtual tributary), 534 


WAN (wide area networks), 538 
waveform encoders, 521 
waves. See signals 
weighted-resistor decoder, 498 
white noise, 375—378, 380—381. See aiso 
AWGN (additive white gaussian noise) 
CDs and, 523 
DSS and, 675 
matched filtering and, 454—457 
mean square value reduction of, 441 
PN (pseudonoise) and, 482 
wideband noise reduction, 8, 10, 416—417 
Wiener-Kinchine theorem, 131-132, 362-363 
wireless phones, 693 
Wolfowitz, J., 592 
word error, 549 
word transmission probability, 557 
WSS (wide-sense stationary) process, 358 


zero crossing, 187, 218, 442 

bit synchronization and, 477-478 
zero-forcing equalizer, 469 
ZOH (zero-order hold), 87-88, 240-241 


